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ABSTRACT 

Determining the scaling relations between galaxy cluster observables requires large samples of uni- 
formly observed clusters. We measure the mean X-ray luminosity-optical richness (L X-N200) relation 
for an approximately volume-limited sample of more than 17,000 optically-selected clusters from the 
maxBCG catalog spanning the redshift range 0.1 < z < 0.3. By stacking the X-ray emission from 
many clusters using ROSAT All-Sky Survey data, we are able to measure mean X-ray luminosities to 
~10% (including systematic errors) for clusters in nine independent optical richness bins. In addition, 
we are able to crudely measure individual X-ray emission from ~ 800 of the richest clusters. Assuming 
a log-normal form for the scatter in the L1X—N200 relation, we measure a\ n L — 0.86±0.03 at fixed A^oo- 
This scatter is large enough to significantly bias the mean stacked relation. The corrected median 
relation can be parameterized by Lx — e"(A2oo/40)^ 10 42 h~ 2 ergss -1 , where a = 3.57 ± 0.08 and 
P = 1.82 ± 0.05. We find that X-ray selected clusters are significantly brighter than optically-selected 
clusters at a given optical richness. This selection bias explains the apparently X-ray under luminous 
nature of optically-selected cluster catalogs. 
Subject headings: galaxies: clusters - X-rays: galaxies: clusters 



1. INTRODUCTION 

Galaxy clusters, as the largest peaks in the cosmic den- 
sity field, play an important role in astrophysics and cos- 
mology (e.g. Borgani 2003; Voit 2005). Structure for- 
mation theory, realized in large scale N-body simula- 
tions, makes robust predictions for cluster space den- 
sity and clustering within various cosmological models. 
Since clusters are also the most observationally accessible 
features of large scale structure, they provide an oppor- 
tunity to place strong constraints on both cosmological 
parameters and the growth of structure. The great chal- 
lenge of cluster cosmology lies in confidently relating the 
dark matter halos we can robustly predict to the bary- 
onic structures we observe. Substantial work is being 
done to close the gap between theory and observations 
from both sides. 

On the theory side, numerical simulations of ever in- 
creasing complexity and resolution provide new insights 
into the evolution of baryons within clusters, and to elu- 
cidate the connection between cluster galaxies and dark 
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matter substructure (Nagai & Kravtsov 2005; Conroy 
et al. 2006). Observationally, much work is being done to 
assemble large samples of clusters detected and observed 
in a wide variety of ways. This is possible because galaxy 
clusters provide a rich suite of observables. Optical light 
is emitted by individual cluster galaxies as well as intra- 
cluster stars. X-rays are emitted by both the hot intra- 
cluster medium (ICM) and AGN within cluster galaxies. 
This same hot ICM scatters microwave background pho- 
tons passing through the cluster, distorting their spec- 
trum. Finally, the total projected mass distribution of 
the cluster produces weak and sometimes strong lensing 
distortions in the images of background galaxies. Each 
of these observables presents an opportunity to detect 
clusters and measure their properties. When combined, 
they allow us to cross-check our understanding of cluster 
physics in a variety of ways. 

Clusters were first detected as early as the 18th cen- 
tury as anomalous groupings of similarly bright galax- 
ies (Biviano 2000). Optical surveys, which are relatively 
inexpensive, have long provided the largest cluster cata- 
logs, primarily because of their ability to detect objects 
with relatively low mass thresholds. While successful 
in identifying thousands of clusters (Abell 1958; Abell 
et al. 1989) and providing the first evidence for dark 
matter (Zwicky 1933, 1937), early optical detection was 
plagued by projection of galaxies along the line of sight 
(Collins et al. 1995). Precise CCD photometry has en- 
abled searches for galaxies clustered in space, brightness, 
and color (for a review of optical selection, see Gal 2006), 
substantially reducing the problems of projection. These 
more recent optical surveys also naturally provide accu- 
rate photometric redshifts. 

With the advent of X-ray satellites, detection of ther- 
mal emission from the hot ICM became possible (Gursky 
et al. 1971). Because X-ray emission depends on the 
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square of the density, this provides a higher contrast that 
is largely immune to projection effects. But when spa- 
tial resolution is low, contamination from non-thermal 
X-ray sources can be a difficult problem. These can be 
point sources like AGN, the non-thermal emission from a 
cooling core, or ongoing merger activity that has thrown 
a cluster out of thermal equilibrium. Over the last sev- 
eral decades, X-ray surveys have been used to assemble a 
large number of cluster catalogs (Gioia & Luppino 1994; 
Ebeling et al. 1996; Rosati et al. 1998; Romer et al. 2000; 
Bohringer et al. 2000; Mullis et al. 2003; Bohringer et al. 
2004; Burenin et al. 2006). Much of our understanding 
of cluster physics today derives from these X-ray selected 
catalogs. 

Substantial effort has gone into comparing optically 
and X-ray selected catalogs (Bahcall 1977; Edge & Stew- 
art 1991; Donahue et al. 2001, 2002; Mulchaey et al. 2003; 
Yee & Ellingson 2003; Gilbank et al. 2004; Popesso et al. 
2004), and to comparing optical and X-ray properties 
of clusters to their weak lensing and SZ signals (Squires 
et al. 1996; Allen 1998; Fischer 1999; Zaroubi et al. 2001; 
Dahlc et al. 2002; Cypriano et al. 2004; De Filippis et al. 
2005; Mahdavi et al. 2005; LaRoque et al. 2006; Hoekstra 
2007; Mahdavi et al. 2007; Bardeau et al. 2007). Com- 
prehensive comparisons of optical and X-ray properties 
of clusters have been hampered by the lack of large sam- 
ples that are uniformly observed in both passbands. 

In this paper we describe measurements of the X- 
ray properties of the largest publicly available optically- 
selected cluster sample: the maxBCG catalog (Koester 
et al. 2007a). This approximately volume-limited clus- 
ter catalog spans the redshift range from 0.1 < z < 0.3. 
Uniform optical photometry and relatively precise pho- 
tometric redshifts (A z < 0.015) for all these clusters are 
available from the same Sloan Digital Sky Survey (SDSS: 
York et al. 2000) data from which the clusters were se- 
lected. X-ray observations of all these maxBCG clusters 
are available from the ROSAT All-Sky Survey (RASS: 
Voges et al. 1999). While RASS exposures are too shal- 
low to allow significant individual detections of every 
maxBCG cluster, they provide precise measurements of 
the mean X-ray luminosity (Lx) as a function of optical 
richness and redshift. In addition, the low signal-to- noise 
measurements of X-ray emission from individual clusters 
can be used both to confirm the measurement of mean 
X-ray emission obtained by stacking and to provide esti- 
mates of the scatter in the optical richness-X-ray lumi- 
nosity relation. 

The maxBCG catalog has been studied in a variety 
of complementary ways. For example, both dynami- 
cal (Becker et al. 2007, henceforth B07) and weak lens- 
ing (Sheldon et al. 2007; Johnston et al. 2007, S07, J07) 
observations of these clusters have been extracted from 
SDSS data. These earlier observations can be combined 
with the mean Lx measurements presented here to pro- 
vide further insight into cluster physics. The Lx~<7 re- 
lation inferred for maxBCG clusters is described here, 
while the corresponding Lx~M 2 oo relation obtained from 
weak lensing is discussed in a companion letter (Rykoff 
et al. 2007). 

An analysis similar to that reported in this paper was 
performed for the NIR selected 2MASS cluster catalog by 
Dai, Kochanek, & Morgan (2007, henceforth DKM07). 
The 2MASS NIR flux-limited catalog has - 4000 nearby 



(z ~ 0.02) groups and clusters ranging in mass from 
<~ 10 13 — 10 15 M Q , selected with a matched-filter algo- 
rithm (Kochanek et al. 2003). DKM07 are the first to 
take a large optically-selected cluster catalog and mea- 
sure the mean (stacked) X-ray properties using RASS, 
rather than simply cross-correlating optically-selected 
and X-ray-selected clusters. They find that the X-ray 
luminosity of 2MASS clusters scales with optical rich- 
ness (their A» 6 66, the number of galaxies brighter than 
L* within <~ i?20o)- In addition, they derive X-ray tem- 
peratures and hydrostatic masses for the stacked 2MASS 
clusters. 

In Section 2, we briefly review the SDSS data, 
maxBCG catalog, and RASS data on which this study 
is based. Section 3 describes our measurement of X-ray 
luminosities for individual clusters, as well as our meth- 
ods for determining the mean X-ray luminosity of a set of 
clusters with similar richness. We describe the mean rela- 
tion of X-ray luminosity as a function of richness, scatter 
in this relation, and the underlying median Lx — A^oo 
relation in Section 4. Section 5 discusses several possible 
sources of systematic bias in these results. In Section 6 
we combine these results with dynamical measurements 
of maxBCG clusters to produce a measurement of the 
Lx — <5 relation. Conclusions and some discussion of fu- 
ture work are presented in Section 7. Throughout this 
work we use a ACDM cosmology with H = 100 h kms -1 
and Q m — 1 — Q\ = 0.3. 

2. INPUT DATA 

The measurements described here are based on two 
wide-area imaging surveys; SDSS and RASS. Galaxy 
clusters are selected from the SDSS five-band imaging 
data using a red sequence selection method. SDSS data 
also allow measurement of cluster redshifts and rich- 
nesses. X-ray emission from these clusters is then mea- 
sured by from the RASS photon maps. In this section 
we describe briefly the SDSS imaging data, galaxy cluster 
selection and calibration, and RASS input data. 

2.1. SDSS data 

Optical data for this study are drawn from Sloan Digi- 
tal Sky Survey 10 : a combined imaging and spectroscopic 
survey of 10 4 deg 2 in the North Galactic Cap and a 
smaller region in the South. The imaging survey was 
carried out in drift-scan mode in five SDSS filters (u, g, 
r, i, z) to a limiting magnitude of r < 22.5 (Fukugita 
et al. 1996; Gunn et al. 1998; Smith et al. 2002). Photo- 
metric errors are typically limited at bright magnitudes 
by systematic uncertainties at the 3% level. The spectro- 
scopic survey targets both a "main" sample of galaxies 
with r < 17.8 and a median redshift of z ~ 0.1 (Strauss 
et al. 2002) and a "luminous red galaxy" sample (Eiscn- 
stein et al. 2001) which is approximately volume limited 
out to z=0.38. For more details of the SDSS see York 
et al. (2000) and Adelman-McCarthy & et al. (2007). 

2.2. maxBCG Catalog 

The maxBCG cluster catalog is selected from imag- 
ing data contained in DR5 of the SDSS. Selection of 
galaxy clusters from this imaging data is done using the 

10 http://www.sdss.org 
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"maxBCG" algorithm. Details of the algorithm are pre- 
sented in Koester et al. (2007b), while the catalog and 
a description of its properties may be found in Koester 
et al. (2007a). In brief, the algorithm exploits two well- 
known features of rich galaxy clusters in addition to the 
tight spatial clustering of cluster galaxies. First, the 
bright end of the cluster luminosity function is dominated 
by red sequence galaxies with a small dispersion in color- 
magnitude space (the E/SO ridgclinc). Second, clusters 
contain a distinct brightest cluster galaxy (BCG) located 
near the center of the galaxy distribution. While some 
clusters lack an obvious central, dominant galaxy, every 
cluster does possess some red sequence galaxy brighter 
than any other. 

The algorithm measures independently the likelihood 
that a galaxy is spatially located in an overdensity of 
E/SO ridgclinc galaxies with similar g — r and r — i col- 
ors, and that it has the color and magnitude properties of 
a typical BCG. Both likelihoods are evaluated for every 
SDSS galaxy at a grid of redshifts. The redshift which 
maximizes the product of these likelihoods is then found 
for each galaxy. For E/SO galaxies, this corresponding 
maximum likelihood redshift then provides a good esti- 
mate of the cluster redshift. 

Once this list of cluster center likelihoods is assem- 
bled, these potential centers are ranked by decreasing 
maximum likelihood. The first cluster is seeded on the 
highest likelihood center, that cluster's BCG. Galaxies 
projected within a scaled radius, -R20O: of this BCG and 
within ±0.02 in z are eliminated from the list of po- 
tential centers. -R200 is the radius interior to which the 
mean density is 200 times the critical density (p cr it) as 
determined from SDSS galaxy populations (Hansen et al. 
2005). The process is repeated for the next most likely 
BCG on the list, given that it has not been eliminated 
by a higher likelihood BCG, and continues likewise until 
all potential centers have either been labeled as cluster 
BCGs or have been subsumed by higher likelihood cen- 
ters. Each cluster defined in this way has a center de- 
fined as the BCG location, an estimated redshift, and 
a richness, N 2 oo, given by the number of E/SO ridgeline 
members falling within -R200 of the BCG and brighter 
than 0.4 L». The final cluster catalog contains an array 
of measured properties, including photometric redshifts, 
richnesses, optical luminosities, and locations. 

The public maxBCG catalog contains a total of 13,823 
clusters drawn from approximately 7500 square degrees 
of sky between redshifts of 0.1 and 0.3, with a median 
redshift z = 0.23. The center for each cluster is defined 
as the location of the BCG identified by the algorithm. 
The richness of the cluster, N 2 oo, ranges between 10 and 
188 in the public catalog, and in principle extends down 
to -/V200 = 1 where the maxBCG selection function is less 
well-understood. For this study we include some clusters 
of lower richness, adding an additional 3532 clusters with 
A 2 oo = 9. This slightly extended catalog allows us to use 
the same richness bins studied in the analysis of maxBCG 
galaxy dynamics (B07) and gravitational lensing (S07, 
J07). 

Redshift estimates for the clusters, produced as part 
of the cluster finding process, have been shown by com- 
parison to spectroscopic redshifts to be quite accurate, 
with A z < 0.015 (Koester et al. 2007a). Catalog com- 
pleteness and purity have been studied in some detail in 



Rozo et al. (2007a); both are quite high. Completeness is 
estimated to be > 90% for clusters with masses greater 
than 10 14 h~ x M , and purity is > 90% for clusters with 
richnesses N200 > 10- 

The cluster population in this catalog has been used to 
derive constraints on cosmology (Rozo et al. 2007a) using 
cluster counts. The relationship between the maxBCG 
richness -/V200 and mass has been studied through galaxy 
dynamics (B07) and weak lensing (S07, J07). Further 
work on galaxy populations, mass-to- light ratios, and im- 
proved richness estimates for these clusters is in progress. 

2.3. ROSAT All-Sky Survey 

The ROSAT All-Sky Survey (RASS, Voges et al. 1999) 
took place primarily during a six-month campaign in 
1990-1991 to image the whole sky in soft X-rays (0.1-2.4 
keV) with the ROSAT Position Sensitive Proportional 
Counter (PSPC, Pfcffermann et al. 1987). The sur- 
vey scanned the sky in great circles, with the largest net 
exposure time (~ 40000 s) near the ecliptic poles. The 
typical field coincident with the maxBCG survey region, 
which does not overlap the northern ecliptic pole, has 
an effective exposure time of ~ 400 s. The point spread 
function (PSF) for these RASS scans is very broad (full- 
width-half-maximum of ~ 25") and is dominated by far 
off-axis photons due to the survey strategy. Voges et al. 
(2001) released reprocessed photon maps and exposure 
maps of the entire RASS survey region. These photon 
maps provide the input for the analysis described in this 
paper. 

RASS data has been used to create several catalogs 
of purely X-ray selected objects. The ROSAT bright 
source catalog (BSC, Voges et al. 1999) consists of 18,811 
sources with a typical signal-to-noise > 4. The position 
resolution is superior to the PSF FWHM with 68% (90%) 
of the sources within 13" (25"). In addition to the BSC, 
there is a companion Faint Source Catalog (FSC, Voges 
et al. 2000) consisting of 105,924 sources with a typical 
signal-to-noise > 2. Most are unresolved. Combined, 
these catalogs provide soft X-ray detected sources which 
can also be compared to the maxBCG clusters. 

The RASS photon data has also been used as an in- 
put for X-ray flux- limited cluster catalogs. The Bright- 
est Cluster Sample (BCS, Ebeling et al. 1998) is a flux- 
limited sample of the brightest 201 clusters in the north- 
ern hemisphere with fluxes Fx > 5 x 10~ 12 erg s _1 cm~ 2 . 
The Northern ROSAT All-Sky galaxy cluster sur- 
vey (NORAS, Bohringer et al. 2000) is a catalog of 378 
extended X-ray sources that have been confirmed to be 
clusters via optical follow-up. Due to the broad ROSAT 
PSF, this catalog is only ~ 50% complete at their stated 
flux limit, Fx > 3 x 10~ 12 erg s _1 cm~ 2 , which corre- 
sponds to a luminosity Lx > 1 x 10 44 h~ 2 crgss -1 at 
our median redshift z = 0.23. Koester et al. (2007a) 
have performed an initial comparison between NORAS 
clusters and maxBCG clusters and have found that the 
maxBCG detects > 90% of NORAS objects. This is 
consistent with estimates of the completeness from sim- 
ulations (Koester et al. 2007a; Rozo et al. 2007b). The 
ROSAT-ESO Flux Limited X-ray galaxy cluster sur- 
vey (REFLEX: Bohringer et al. 2004) is similar to NO- 
RAS in the southern sky, with the same flux limit. This 
catalog of 447 clusters is over 90% complete due to im- 
provements in RASS analysis, although only a small frac- 
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Table 1. Number of Clusters In Each Bin 



Richness Range a 


- 

-"200 


z 

(h,- 1 kpc) 


D b 
-K200 


"clust 


■"stack 


71 < Ar 20 o < 188 


92.85 


0.21 


1727 


55 


55 


51 < JV200 < 70 


58.22 


0.21 


1469 


146 


140 


41 < 7V 2 oo < 50 


44.67 


0.21 


1317 


207 


201 


33 < JV 2 oo < 40 


35.74 


0.21 


1201 


356 


339 


26 < N 20 o < 32 


28.57 


0.22 


1102 


665 


633 


21 < JV200 < 25 


22.70 


0.22 


997 


1128 


1060 


18 < JV 2 oo < 20 


18.91 


0.23 


941 


1141 


1099 


12 < N 200 < 17 


13.88 


0.23 


823 


5651 


5405 


9 < AT 200 < 11 


9.80 


0.23 


727 


7986 


7566 


random 






7986 


7529 





a A r 200 is the number of red-sequence galaxies brighter than 0.4 
L r within a scaled aperture -R200 • 

b -R200 is the radius internal to which the mean density is 200 times 
the critical density (Hansen ct al. 2005). 



tion of REFLEX overlaps the maxBCG survey area. 

3. X-RAY ANALYSIS 

The typical RASS exposure time for maxBCG clus- 
ters, 400 s, is too short to allow significant detections for 
individual clusters. The large number of maxBCG clus- 
ters, however, allows us to make up for this. For exam- 
ple, there are 7986 clusters with richnesses 9 < A^oo < 
11. For these objects, the total RASS exposure time 
is ~ 320,000 s. Such a large total exposure allows us 
to measure the mean X-ray emission from these clusters 
quite precisely. 

Details of the stacking method are described later in 
this section, and outlined here. We begin by dividing 
the clusters into nine richness (-/V200) bins. To simplify 
comparison to other maxBCG analyses, we use the same 
richness bins used in measurements of the mean veloc- 
ity dispersions (B07). The number of clusters in each 
richness bin is shown in Table 1. 

As we do not have X-ray centers for individual clusters, 
we treat the BCG selected by the maxBCG algorithm as 
the center of each cluster and stack on these centers. In 
Koester et al. (2007a) it was shown that most maxBCG- 
selected BCGs (<~ 80%) agree well with the center of X- 
ray selected clusters, which has also been seen for other 
optically-selected catalogs (e.g. Lin & Mohr 2004). Pos- 
sible biases introduced by this assumption are discussed 
in Section 5.2. Every source and background photon in 
the stacked analysis is scaled and weighted to the me- 
dian redshift of the clusters in the catalog, z = 0.2296. 
With these weighted photon maps we construct stacked 
images in Section 3.2, radial profiles in Section 3.3, and 
spectra and luminosities in Section 3.4. 

3.1. Cluster Extraction and Selection 

We obtain RASS photon data and merged expo- 
sure maps from the archives available at the High 
Energy Astrophysics Science Archive Research Center 
(HEASARC). The RASS data and exposure maps are 
distributed in arbitrarily constructed 6.4 x 6.4 degree 
fields with significant overlap. Nevertheless, since we 
utilize a large background annulus extending to 40' from 
the central BCG, a significant fraction (<~ 20%) of the 
maxBCG clusters fall across a field boundary. We have 



therefore built a tool that extracts all RASS photons in 
a given aperture, from multiple RASS fields if necessary, 
counting only once those photons which appear multiple 
times in overlap regions. 11 After photon retrieval, the 
appropriate merged exposure maps (mex file) are used to 
calculate the effective exposure time (U) at the position 
of each detected photon. 

Before stacking we exclude from the list a subset of 
maxBCG clusters which might bias our X-ray measure- 
ments for unphysical reasons. No cut on Galactic ab- 
sorption is required, as the SDSS observations are re- 
stricted to high galactic latitude. The typical equivalent 
Galactic hydrogen column density is a very low: Nh = 
2x 10 20 cm" 2 (Dickey & Lockman 1990) for the maxBCG 
clusters, with a maximum of Nh = 9 x 10 20 cm- 2 . Al- 
though the typical RASS exposure time is <~ 400 s, there 
are some fields with significantly less. This makes source 
and background estimation difficult, while not adding 
significantly to the signal. For this reason, we reject all 
clusters with less than 200 s mean exposure time. This 
removes 4% of the total number of clusters, but less than 
1% of the net exposure time. 

An important possible source of contamination for 
stacked cluster measurements is the presence of a few 
extremely bright foreground sources. We would like to 
reject regions contaminated by these sources. RASS im- 
ages of maxBCG clusters, especially at higher redshift, 
are often unresolved, making it difficult to remove point 
sources using extent information, so we proceed as fol- 
lows. We first identify those ROSAT Bright Source Cat- 
alog (BSC) sources in the survey area with count rates 
higher than that expected for emission from any clus- 
ter at redshift beyond 0.1. This corresponds to a count 
rate of ~ 2cts _1 . There are 179 BSC sources in the 
maxBCG survey area with ROSAT soft-band (0.1-0.5 
kcV) or hard-band (0.5-2.1 keV) count rates above this 
limit. Of these 179 sources, only 13 are clusters as iden- 
tified in the ROSAT Brightest Cluster Sample (BCS, 
Ebeling et al. 1998). Of these 13 clusters, only one of 
these objects is associated with a maxBCG cluster (Abell 
2142, the richest and one of the nearest clusters in the 
catalog). The other high- flux clusters are at redshifts 
z <C 0.1. Further visual inspection has confirmed that 
Abell 2142 is the only maxBCG cluster associated with 
one of these extremely bright sources. For the stacking 
analysis, we reject all maxBCG clusters (except for Abell 
2142) within 45' of any of these bright sources, due to 
possible contamination from non-cluster photons. This 
removes only 0.6% of the maxBCG catalog. 

The Virgo cluster is another important foreground 
source. This nearby, X-ray bright cluster dominates the 
RASS emission over a significant region of sky, thus mak- 
ing it difficult to estimate the X-ray emission from back- 
ground clusters. We therefore reject all clusters that fall 
within 1.5 degrees of M86 or M87. This is a relatively 
small effect, removing only 0.1% of the maxBCG clusters. 
Table 1 shows the number of clusters in each richness bin 
before and after the rejections outlined in this section, as 
well as the mean richness for each bin. 

To stack the X-ray measurements of the clusters, RASS 

11 RASS photons are tagged with the detector location, energy 
channel, and time of arrival, providing a unique description of each 
photon. 
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photons from each cluster must be scaled and appropri- 
ately weighted. The projected physical distance to the 
BCG is calculated for each photon, and this scaled dis- 
tance is used in image construction and radial profile cal- 
culations. Each source and background photon is then 
weighted by a factor of w(z) — {cIl{z) / cIl{z)) 2 , where 
cIl(z) is the luminosity distance to the cluster and cIl(z) 
is the luminosity distance to the median cluster redshift 
z = 0.2296. For background estimation we use a fixed 
annulus with inner (outer) radius of 20' (40'), similar 
to that used for the NORAS catalog for individual clus- 
ters (Bohringer et al. 2000). This background annulus 
corresponds to a physical distance of 1.5 — 3.1 /i -1 Mpc 
at our minimum redshift of 0.1 and a 3.8 — 7.5 h^ 1 Mpc 
at our maximum redshift of 0.3. We confirm that our re- 
sults are not sensitive to the exact choice of background 
annulus. To calculate the weighted count rate (C w ) in 
a given bin (image, radial, or spectral) for a cluster at 
redshift z, we follow a procedure similar to Bohringer 
et al. (2000), where C w = J2 i , w(zi)/ti, where U is the 
exposure time at the position of each detected photon. 

As a simple test of our stacking procedure, we also ex- 
tract RASS photons from a set of positions selected at 
random from the region covered by the maxBCG catalog. 
Each such random position is associated with a real clus- 
ter redshift drawn from the catalog in the 9 < A 2 oo < H 
bin. The random points are analyzed identically to the 
real cluster positions. 

3.2. Stacked X-Ray Images 

The background-subtracted stacked images are shown 
in Figure 1. Each image has a projected radius of 
2.0 h~ x Mpc. The images contain photons from the 
ROSAT hard band (channels 51-201, 0.5-2.0 kcV) as this 
band has the highest signal-to-noise for cluster emission. 
After background subtraction, the counts were put in 
100 x 100 hr 1 kpc bins. For display, each image has been 
scaled with histogram equalization to show similar back- 
ground noise levels, as there are two orders of magnitude 
more clusters in the poorest bin than the richest bin. The 
contours are drawn 3a and 5a above background level. 

Highly significant X-ray emission is seen in each stack 
of clusters. By contrast, in a stack of 7,566 random points 
there is no significant excess, and no sign of emission 
centered in the stack. It is worth noting that the average 
count rate in the 9 < A 2 oo < H bin is only 0.0025 cts -1 , 
corresponding to a flux Fx = 5 x 10~ 14 erg s _1 cm -2 . 
There are a total of <~ 8500 cts in excess of background 
from ~ 8000 clusters; an average of only 1 source count 
per cluster. This illustrates the power of the stacking 
exercise to probe very low count rates, and allows us to 
use RASS to measure very low Lx cluster emission even 
at a moderate redshift of <~ 0.2. 

3.3. Radial Profiles 

Figure 2 presents the background-subtracted surface- 
brightness profiles for the stacked X-ray images in each 
richness bin. The radial profiles have been summed in 
50 h^ 1 kpc (100 h^ 1 kpc) bins inside (outside) 1 h' 1 Mpc. 
For reference, the dotted lines represent the \a back- 
ground level. Significant cluster emission is seen in each 
richness bin. For the richest bins, this extends out 
beyond lh^ 1 Mpc. The X-ray surface brightness pro- 
files for the different bins look remarkably similar, ex- 



Table 2. (3 Model Parameters 



Richness Range 



P 



Rdh- 1 Mpc) x 2 /^ 



71 < A^oo < 188 
51 < AT 2 oo < 70 
41 < A^oo < 50 
33 < JV200 < 40 
26 < Af 2 oo < 32 
21 < Afaoo < 25 
18 < JV 20 o < 20 
12 < AT 20 o < 17 
9 < Af 20 o < 11 



0.67 ± 0.03 
0.55 ± 0.03 
0.62 ± 0.04 
0.64 ± 0.04 
0.57 ± 0.03 
0.75 ± 0.08 
0.51 ±0.03 
0.57 ± 0.03 
0.54 ±0.04 



0.31 
0.23 
0.28 
0.31 
0.24 
0.40 
0.21 
0.24 
0.22 



±0.03 
±0.02 
±0.03 
±0.03 
±0.03 
±0.05 
±0.03 
±0.03 
±0.04 



37.7/72 
43.2/72 
36.2/72 
75.1/72 
46.3/72 
88.1/72 
38.9/72 
36.2/72 
63.4/72 



cept for the normalization which increases strongly with 
richness and the signal-to-noise which decreases slowly 
with richness. We fit the surface brightness profiles 
out to 1.5 ft. -1 Mpc with a standard (3 model, S(R) = 
S Q (1 + R 2 /R 2 c )- ifi+1 / 2 . The best-fit parameters are pre- 
sented in Table 2, and are overplotted with dashed lines 
in Figure 2. The (3 model results in a good fit in all cases. 

The core radii, R c , for these (3 model fits are sur- 
prisingly large, ranging from ~ 200 /i -1 kpc to ~ 
400 ft- -1 kpc. These are significantly larger than those 
typically seen for X-ray clusters (~ 50 — 150 h^ 1 kpc, e.g. 
Neumann & Arnaud 1999). The (3 parameters are typ- 
ical for X-ray clusters (e.g. Neumann & Arnaud 1999; 
Sanderson et al. 2003). There is a slightly significant 
(~ 3er) trend in (3 with richness, such that the richer 
clusters tend to have slightly larger values of (3. This 
slight trend is in the same sense as has been seen in pre- 
vious work correlating (3 with cluster mass (Horner et al. 
1999; Helsdon & Ponman 2000; Sanderson et al. 2003). 
These /3-model parameters are similar to those measured 
by DKM07, in which f3 was not seen to be strongly corre- 
lated with richness. These inconclusive results are caused 
by two effects; the broad point spread function for RASS 
objects, and the offset distribution between the BCG and 
the X-ray cluster emission. 

3.3.1. RASS PSF effects 

To study the effect of the RASS PSF on the stacked 
profiles, we test our analysis on bright ROSAT point 
sources. These sources were selected from the WGA 
catalog (White et al. 2000), which contains 88,621 well 
measured point-like sources selected from ROSAT PSPC 
pointed observations. We take ~ 500 moderately bright 
(0.1-0.5 cts -1 ) point sources that overlap the SDSS DR5 
mask. These bright sources are nearby Galactic sources 
as well as AGN and quasars at moderate and high red- 
shift, and are also well detected in RASS. Each point 
source is randomly assigned a redshift drawn from the 
maxBCG cluster catalog redshift distribution. We note 
that a nearby unresolved point source is indistinguish- 
able from a distant unresolved point source. The point 
sources are then stacked in exactly the manner described 
for clusters above. Figure 3 shows the radial profile of the 
stacked point sources (diamonds), which is essentially a 
measure of the RASS PSF with the radial scaling cal- 
culated for the maxBCG cluster redshifts. While this 
radial profile is much more sharply peaked than that of 
the stacked clusters, the broad ROSAT PSF scatters sig- 
nificant X-ray emission to distances > 1ft, -1 Mpc (7' at 
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random 



Fig. 1. — Stacked X-ray images for the 9 richness bins and random points. Each image has a projected radius of 2.0/i~ 1 Mpc. The 
ROSAT hard band (0.5-2.0 keV) counts have been background subtracted and put in 100 X 100 h^ 1 kpc bins. For display, each image has 
been scaled with histogram equalization to show similar background noise levels. The contours are drawn 3cr and 5<r above background 
level. 



z = 0.2) from the center. This PSF smearing implies an 
effective minimum on the /3-model core radius, as well 
as creating a possible bias in the calculation of the [3 
parameter. We note that < 10% of the selected WGA 
point sources are within 10' of maxBCG clusters. Ex- 
cluding the point sources that are neighboring these pos- 
sibly extended X-ray sources does not significantly alter 
this result. 

3.3.2. Optical / X-ray offsets 



The stacked, projected surface brightness profile may 
also be affected by our choice of cluster centers. There are 
several ways in which the BCG chosen by the maxBCG 
algorithm might be offset from the X-ray emission from 
the cluster. Dynamically active clusters and clusters in 
the process of merging can have a significant offset be- 
tween the X-ray centroid and the BCG. There is also the 
possibility that the maxBCG algorithm chose an incor- 
rect center (this is addressed more fully in Section 5.2). 
In addition, there may be additional X-ray point sources 
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Fig. 2. — The background-subtracted surface-brightness profiles for the stacked images in each richness bin. The radial profiles have been 
summed in 50 h~ kpc (100/i _1 kpc) bins inside (outside) lh~ 1 Mpc. For reference, the dotted lines represent the la background limit. 
Significant cluster emission is seen in each richness bin. For the richest bins, this extends out beyond 1 h~ 1 Mpc. The dashed lines show 
the /3 model fits from Table 2. 




0.6 0.8 1.0 
Radius (h ' Mpc) 

Fig. 3. — The radial profile of stacked point sources selected from 
the WGA (White et al. 2000) catalog. Each point source has been 
randomly assigned a rcdshift from the maxBCG rcdshift distribu- 
tion to simulate the effect of smearing due to radial scaling. The 
diamonds show the stacked point source profile with the centers as 
determined in WGA; there is significant emission out to distances 
> 1 h^ 1 Mpc (7' at z = 0.22). The circles show the stacked point 
source profile after convolving with the estimated maxBCG/BSC- 
FSC offset distribution. For reference, the dotted line represents 
the lcr background limit. The outer profile is well-fit by a /3-model 
(dashed line) with /3 = 0.61 ± 0.01. Thus, a cluster-like radial 
profile is obtainable in stacked images even without true extended 
cluster emission. 



such as AGN that are associated with the clusters. All 
of these effects result in an effective optical/X-ray offset 
distribution that may bias the radial profiles. 

We model the optical/X-ray offset distribution by 
matching the maxBCG catalog to known X-ray sources. 
For this exercise, we use the fact that many maxBCG 
clusters (> 900) are associated with individual detections 
in the ROSAT Bright Source and Faint Source Catalogs. 
Most of these BSC and FSC sources have not been pre- 
viously recognized as associated with clusters, primarily 
because they are too faint or too distant to be seen as 
significantly extended in RASS. We match the maxBCG 
clusters to the BSC and FSC catalogs, allowing multiple 
X-ray sources to match to each cluster. This ensures that 
we find all X-ray sources associated with each cluster, as 
well as all possible random matches. We similarly match 
an equal number of random locations drawn from the 
maxBCG survey region to the BSC and FSC catalogs. 
Figure 4 shows the projected offset distribution from the 
maxBCG clusters to the BSC and FSC catalogs. The 
solid (dashed) line shows a histogram of maxBCG (ran- 
dom) offsets in 50 /i -1 kpc bins. The dotted line shows 
the residual distribution after subtracting the random 
matches from the cluster matches. 

The subtracted histogram in Figure 4 shows a large 
excess of X-ray sources associated with the optical clus- 
ter centers. There is a tight core in which the BCG 
is within ~ 150/i _1 kpc of an X-ray source, as well as 
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Fig. 4. — Histogram of offset distribution between maxBCG 
clusters and X-ray sources detected in the BSC and FSC catalogs. 
Each cluster is allowed to match to multiple X-ray sources, ensur- 
ing we find all associated X-ray sources in addition to all random 
matches. The solid (dashed) line shows the histogram of maxBCG 
(random) offsets in 50 h^ 1 kpc bins. The dotted line shows the re- 
sult of subtracting the random matches from the cluster matches. 
There is a tight core in which the BCG is within ~ f 50 kpc of 
an X-ray source, as well as a long tail. 



a long tail extending out to ~ 1500 h" 1 kpc. The X- 
ray source excess at large radius is likely to be associ- 
ated with the maxBCG clusters. These sources comprise 
a mix of merging clusters, clusters with poorly identi- 
fied centers, and associated point sources such as AGN. 
The subtracted histogram is used as a first-order empir- 
ical radial distribution of X-ray sources associated with 
maxBCG cluster centers. 

We now estimate the additional effect on the radial pro- 
file due to the radial offset distribution of X-ray sources 
associated with the maxBCG clusters. We take each 
WGA point source from Section 3.3.1 and randomly alter 
its position so that the distribution is offsets is identical 
to the empirical radial distribution. Although we have 
only measured the offset distribution for the brightest 
~ 5% of maxBCG clusters, we assume this is a good 
proxy for the offset distribution between the X-ray emis- 
sion and BCGs of all the maxBCG clusters. These offset 
point sources are run through our stacking analysis. Fig- 
ure 3 shows the radial profile of the offset point sources 
after stacking (circles). This radial profile is reasonably 
well fit with a (3 model, although the fit is poor within 
<~ 200 hr x kpc. At large radii the radial profile is well fit 
with (3 = 0.61±0.01. Thus, at large radii it is not pos- 
sible to distinguish between true extended cluster emis- 
sion and X-ray point sources convolved with the observed 
maxBCG-X-ray source offset distribution. 

This calls into question the utility of the (3 model pa- 
rameterization for this exercise. The stacked X-ray pro- 
file is a convolution of the ROSAT PSF, the centering 
distribution, and the true extended X-ray emission. Op- 
erationally, it is not possible to separate the contributions 
from these three components. Therefore, the (3 parame- 
ters that are the result of a stacking exercise such as this 
one, where most of the individual X-ray clusters are not 
detected, should be used cautiously. 



3.4. Stacked X-ray Spectra 

We perform spectral fitting of the photons in each stack 
to calculate X-ray luminosities. The spectra were ana- 
lyzed using XSPEC version 11.3.2 (Arnaud 1996). Spec- 
tral analysis on the stacked ROSAT data involves a va- 
riety of complications, some of which are discussed in 
DKM07. We outline some key issues here. 

All RASS observations integrate exposure time across 
the ROSAT field-of-view. As a result, we use the spectral 
response file pspcc_gainl_256 . rmf from HEASARC, 
suitable for off-axis observations. When combined with 
the appropriate ancillary response file (arf ), we obtain 
the vignetting corrected spectral response for the off-axis 
photons that comprise our spectra. However, the ex- 
posure times extracted from the merged exposure maps 
for RASS are already corrected for vignetting, using the 
mean spectrum of the X-ray background in the PSPC 
band 12 . In practice, this means that our spectra, fit 
with the standard rmf+arf combination, are corrected 
for vignetting twice - once by using the exposure times 
obtained from the merged exposure map, and once in 
the calculation of the arf file. By comparing the on- 
axis response file rsp from HEASARC with the com- 
bination of the rmf+arf we can approximate the extra 
vignetting correction. This correction is a function of 
energy, and depends (weakly) on the spectrum of the 
observed source. By simulating cluster spectra of var- 
ious temperatures, we calculate the typical correction 
factor averaged across the ROSAT hard band (0.5-2.0 
keV) which is dominated by cluster flux. This correc- 
tion factor is <~ 1.53. Using this correction factor results 
in very good agreement between our spectral analysis 
and the REFLEX count-rate to flux conversion tables 
of Bohringer et al. (2004). We estimate that applying 
this correction factor to all flux and luminosity values 
determined from stacked spectra adds an additional sys- 
tematic error of ~ 10%. 

For our analysis, we use the rmf file described above, 
combined with an arf output with the FTOOL pcarf . 
As noted in DKM07, each arf file is essentially iden- 
tical, because each stacked cluster from RASS samples 
photons from the entire ROSAT field of view. Spectral 
files are then grouped with a minimum of 50 counts per 
bin (after background subtraction) to ensure valid re- 
sults using x 2 statistical analysis. Fits performed with 
XSPEC were restricted to the 0.1-2.1 keV range. The 
uncertainties in spectral fit parameters are 90% confi- 
dence errors, obtained by allowing all fit parameters to 
vary simultaneously. 

We obtain cluster spectra by summing all weighted 
cluster photons in both fixed physical apertures and 
scaled apertures of the optically determined i?200 (see 
Section 2.2). The fixed aperture of 750 h~ x kpc is chosen 
as a reasonable fiducial value because this provides good 
signal-to-noise, and because the radial profiles do not 
appear to change significantly with richness. The -R200 
value for each bin was taken as the median i?2oo of all the 
clusters in the richness bin; these values are reported in 
Table 3. Background spectra are stacked with the same 
weights as the source spectra, using the annuli defined 
in Section 3.1. As described previously, each photon is 

12 see the manual page for the FTOOL pcexpmap 
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weighted to the median rcdshift z = 0.2296. Unfortu- 
nately, fc-correction of individual photons is not possible, 
due to the fact that the detection channel of the inci- 
dent photons is most strongly dominated by the spectral 
response of the ROSAT instrument. The stacking pro- 
cedure will therefore tend to "smear out" the incident 
cluster spectra. Simple simulations of X-ray spectra us- 
ing XSPEC show that this does not create a large effect 
on our best-fit spectral values, as our redshift range of 
0.1 < z < 0.3 is not particularly broad. Similarly, we 
do not make any corrections for possible redshift evolu- 
tion in X-ray luminosity; we will address this further in 
Section 5.3. 

The spectra are fit with an absorbed thermal plasma 
model (Raymond & Smith 1977). The mctallicity is fixed 
at 0.3 solar (e.g. Anders & Grevesse 1989), and the red- 
shift is set to the median scaled redshift of 0.2296. The 
luminosities are calculated in the rest-frame 0.1-2.4 keV 
band at the median redshift. We prefer to calculate the 
0.1-2.4 keV luminosity rather than bolometric luminos- 
ity (Lx,boio), due to the large extrapolations required 
to obtain bolometric values, which strongly depend on 
spectral temperatures (see below). The best-fit spectral 
parameters for the scaled i?200 apertures are shown in 
Table 3, and the parameters for the fixed 750/i~ 1 kpc 
apertures are shown in Table 4. 

In order to account properly for the variations in clus- 
ter luminosity in each richness bin, we use bootstrap re- 
sampling to estimate the luminosity errors. In each rich- 
ness bin, we run 2000 trials by sampling the same number 
of clusters in that bin with replacement. To save consid- 
erable time with processing, we did not recreate the en- 
tire stacking procedure for each resampling. Instead, we 
take the individual cluster counts, scaled to the median 
rcdshift. We confirm that the average scaled count rate 
is a good proxy for luminosity in the 0.1-2.4 keV band 
(see Section 4.1 for details on this calculation). The 68% 
(lcr) confidence interval obtained from the bootstrap is 
added in quadrature with the Lx errors obtained from 
the spectral fits. Only the richest (71 < A 2 oo < 188) bin 
is dominated by the bootstrap error calculation, due to 
the large range in richness and Lx in the bin. 

Figure 5 shows the mean Lx~Tx relation for our 
stacked clusters (within the scaled i?200 aperture), com- 
pared to the Markevitch (1998) L x -T x (0.1-2.4 keV) re- 
lation. We see the cluster temperature increase with lu- 
minosity, but our observed Lx~Tx relation is noticeably 
steeper than that of Markevitch (1998). The stacked X- 
ray temperatures appear to jinderestimate the expected 
temperature, especially at Lx larger than a few times 
10 43 h~ 2 ergss -1 . This discrepancy highlights the chal- 
lenge of measuring cluster temperatures with ROSAT, as 
well as the challenges of measuring stacked cluster tem- 
peratures. First, ROSAT has sensitivity only to soft X- 
rays. When the break of the bremsstrahlung spectrum, 
determined by the temperature of the hot gas, is above 
<~ 2.0 keV, then the X-ray temperature becomes diffi- 
cult to constrain. The ROSAT temperatures of hot clus- 
ters are generally underestimated; a particular example is 
Abell 1689. This bright cluster has a nearly isothermal 
profile in XMM/Newton observations with a tempera- 
ture of ~ 9 keV (consistent with the ASCA and Chandra 
values), but has a best-fit i?OS^4T/PSPC temperature 




Fig. 5. — Mean Lx~Tx relation for the stacked clusters, with the 
fits within R200- The dashed line shows the Markevitch (1998) re- 
lationship for "uncorrected" 0.1-2.4 keV luminosity. The observed 
relation is much steeper than that derived by Markevitch (1998). 
This is due to the fact that ROSAT temperatures tend to underes- 
timate cluster temperatures with Tx > 3 keV, as well as the com- 
plications in stacking many clusters with different temperatures. 



of 4.3to;gkeV (e.g. Andersson & Madejski 2004). Sec- 
ond, when we stack many non-isothermal clusters with 
different temperatures, our isothermal Raymond-Smith 
spectrum is no longer appropriate. For example, Ra- 
sia et al. (2005) have shown for Chandra observations 
how the spectroscopically-weighted temperature can dif- 
fer significantly from the emission- weighted temperature 
for single clusters, with the better measured colder gas 
dominating the spectral fit. When these effects are com- 
bined, interpreting the average best-fit spectral temper- 
ature from a stack of hundreds of clusters is non-trivial. 
_ Most importantly for our task of measuring the mean 
L x , the ROSAT luminosity in the 0.1-2.4 keV band is 
virtually insensitive to the cluster temperature. If we fix 
the spectral temperature to the value predicted by the 
Markevitch (1998) Lx~Tx relation, then Lx changes by 
< 5% in each bin, with no systematic bias with richness. 
Similarly, we can follow the prescription of Bohringer 
et al. (2004) to convert the mean 0.5-2.0 keV RASS count 
rate to flux and luminosity. This method assumes the 
Markevitch (1998) Lx-Tx relation, and requires itera- 
tively calculating the flux and temperature. The Lx 
values thus obtained are consistent with those from the 
spectral fitting to within < 3%. All this suggests that cal- 
culating 0.1-2.4 keV L x obtained from ROSAT — which 
relies primarily on photon counting — is more robust than 
calculating Tx and extrapolating to Lj^boio- Although 
a small fc-correction (~ 10%) is required to extrapolate 
from observer frame 0.1-2.4 keV to rest-frame luminos- 
ity, this correction is not very sensitive to the spectral 
temperature at the moderate redshift of the maxBCG 
clusters. 

4. MEAN L x -N 2 oo RELATION 

We stack the clusters and calculate the 0.1-2.4 keV 
X-ray luminosity as described in the previous section. 
Figure 6 shows the mean Lx _ ^200 relation, both for a 
fixed 750 h^ 1 kpc aperture and for the scaled i?2oo aper- 
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Table 3. Spectral Fits within ifeoo 



Richness Range 


N200 


R200 


N H 


kT 


Lx 


X 2 /» 






(h- 1 Mpc) 


(10 20 cm~ 2 ) 


(keV) 


(10 42 /i- 2 ergss- 1 ) 




71 < JV200 < 188 


92.85 


1.73 


2.5 ±0.4 


3 3+1.4 


228 ± 33 


47.2/76 


51 < ATonn < 70 


58.22 


1.47 


1.5 ± 0.3 


J - -1.1 


80.6 ± 6.9 


61.5/77 


41 < iV 2 oo < 50 


44.67 


1.32 


9 s+°- 7 

^•8-0.6 


z -°-0.6 


58.0 ± 7.3 


87.4/65 


33 < N 200 < 40 


35.74 


1.20 


9 + O.6 


2 +0 ' 6 


47.7 ±4.8 


100.6/95 


26 < N 200 < 32 


29.57 


1.10 


3 4 +0.9 


1 7 +0.4 
L - 1 -0.3 


28.4 ±3.0 


62.9/95 


21 < JV200 < 25 


22.70 


0.99 


9+O.9 
° -0.6 


1 6 +0 ' 4 
1 -°-0.2 


18.5 ± 2.1 


113.2/97 


18 < JV200 < 20 


18.91 


0.94 


1 5+°' 4 


1 6 +0 ' 4 
1 -°-0.2 


12.5 ± 1.1 


77.3/88 


12 < N 20 q < 17 


13.88 


0.82 


2 +0 ' 4 


1 4+03 
1 -0.1 


6.84 ± 0.47 


182.3/134 


9 < JV200 < 11 


9.80 


0.73 


2 +0 ' 7 


, 1+O.O8 
1 ' 1 -0.06 


3.30 ± 0.39 


127.8/112 



Table 4. Spectral Fits within 750/i _1 kpc 



Richness Range 


^200 


R200 


N H 


kT 


Lx 


X 2 /" 






(h- 1 Mpc) 


(10 2() cm- 2 ) 


(kcV) 


(10 42 /i- 2 ergss- 1 ) 


71 < JV200 < 188 


92.85 


1.73 


1.8 ±0.3 


, 1+1.1 
°- 1 -0.7 


167 ± 24 


40.5/61 


51 < iV 20 o < 70 


58.22 


1.47 


1.6 ±0.3 


3.8+ 2 ' 2 


57.3 ±4.9 


40.1/59 


41 < iV 2 oo < 50 


44.67 


1.32 


2.2 ±0.4 


2 6+ l!l 


45.6 ±5.6 


85.6/61 


33 < JV 2 oo < 40 


35.74 


1.20 


2.3 ±0.4 


9 c+0.8 
z -°-0.5 


38.8 ±3.9 


92.0/81 


26 < N20Q < 32 


28.57 


1.10 


3.0 ±0.6 


1 R+0' 5 
1 -°-0.2 


23.4 ±2.3 


78.1/84 


21 < iV 20 o < 25 


22.70 


0.99 


2.6 ±0.5 


I 7+O.4 

II -0.2 


16.4 ± 1.8 


86.0/94 


18 < N200 < 20 


18.91 


0.94 


2.0 ±0.5 


1 5 +0 ' 4 
L -°-0.2 


10.9/pm0.9 


73.6/76 


12 < jV 2 oo < 17 


13.88 


0.82 


2.0 ±0.3 


1 4+0-3 


6.50 ±0.45 


182.3/134 


9 < JV 20 o < 11 


9.80 


0.73 


2.1 ±0.7 


1 1+ - 2 
i - ± -0.06 


3.40 ±0.42 


137.6/114 



tures. We find a strong correlation between Lx and 7V 2 oo 
with both the fixed and the scaled apertures. In the low 
richness bins (iV2oo < 17), the scaled aperture i?2oo is 
approximately equal to the fixed 750 ft. -1 kpc aperture, 
and Lx is approximately the same for both choices of 
aperture. In the high richness bins, the scaled aperture 
-R200 is significantly larger than 750 ft -1 kpc, yielding a 
slightly larger_Ljf for the scaled apertures. For this rea- 
son, the Lx~N 2 oo relation is slightly steeper using scaled 
apertures. The best-fit mean relationships are: 



L x {< i?20o) = e 



3.87+0.04 



N- 



200 



40 



1.82+0.05 



10 42 /i^ergss" 1 



(1) 



o750 h" 1 kpc aperture 
□ R 200 aperture 



3.63+0.04 



N< 



200 



1.64+0.05 



10 h crgss 



Lx(< 750/ i - 1 kpc) = e ; 

(2) 

Power law fits are chosen to pivot around -/V200 = 40 
to approximately decouple errors in slope and normal- 
ization. 

DKM07 also found that the mean X-ray luminosity of 
the 2MASS clusters scales with optical richness (iV*666)- 
A direct comparison between the DKM07 relation and 
our relation is challenging, due to the different luminos- 
ity and richness definitions we employ, but a first-order 
conversion is possible. First, we convert the DKM07 
bolometric luminosities (their Table 2) to ROSAT 0.1- 
2.4 keV luminosities using Table 5 of Bohringer et al. 



n 2 oo 



Fig. 6. — Mean Lx-A^oo relation for maxBCG clusters, for 
the fixed 750 /i -1 kpc apertures (diamonds) and the scaled -R200 
apertures (squares) . There is a strong correlation between Lx and 
JV200, with a slope of 1.64±0.05 (750 h^ 1 kpc) or 1.82±0.05 (R 2 oo)- 
The scaled aperture relation is steeper because the aperture scales 
with JV200- 



(2004) 13 Re- fitting for the L x -N* 6e6 relation, we find 

13 This does not introduce any additional error, as DKM07 have 
published the precise spectral temperature they used to calculate 

■kx.bolo- 
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L x oc (JV.aae) 1 - 51 ^- 07 . 14 As the 2MASS catalog is not 
yet public, we cannot make a direct comparison of A^oo 
to A* 666 for individual clusters. Futhermore, most of the 
clusters in the 2MASS catalog are at z < 0.1, while the 
maxBCG catalog covers 0.1 < z < 0.3. At this time, the 
most we can say is that the slope of the Lx^richness re- 
lation is roughly similar between our work and DKM07. 

4.1. Scatter in the L x -N 2 ao Relation 

Comparisons to X-ray selected catalogs (see Sec- 
tion 4.2) as well as prior studies (e.g. Smail et al. 1998; 
Gilbank et al. 2004) lead us to expect significant scatter 
in the X-ray luminosity-richness relation for individual 
clusters. With a large scatter, the mean statistic used 
to calculate the stacked L x -N 2 qq relation may be signif- 
icantly biased relative to the median relation. To under- 
stand this bias, we first assume that there is a log-normal 
conditional probability p for the X-ray luminosity of a 
cluster at richness N 2 no- 



p(L x \N 200 ) = 



2na\ n , 



■ exp 



-{\nL x -\nL x y 



, (3) 



with constant intrinsic scatter ai n L, and mean log lumi- 
nosity that follows: 



lnL x (A 200 ) - A + Bln(A 200 /40), 



(4) 



where A is the log normalization of L x at A 2 oo = 40, 
and B is the slope of the L x -N 2 oo relation. Note that 
cxp(lnix) for a log- normal distribution is equivalent to 
the median (and geometric mean) of the distribution. 
For the duration of this paper, we employ the notation 

L x = cxp(lnLx)- 
At a given richness, we wish to measure the median 

X-ray luminosity L x , the peak of the underlying log- 
normal distribution. However, the stacking exercise we 
have undertaken is fundamentally a calculation of the 
arithmetic mean (L x ) at a given N 2 oo- For a log- normal 
distribution with median L x and intrinsic scatter <7i n L, 
the arithmetic mean is exp(lnix + af nL /2). Thus, the 
stacked normalization is an overestimate of the median 
of the underlying distribution by a factor of exp(of ni /2). 
If the scatter is large, the stacked (mean) normalization 
will be dominated by the most luminous clusters and will 
be biased high. For example, an 80% scatter indicates a 
- 30% bias. 

To constrain this scatter, we begin with measurements 
of X-ray flux (and hence L x ) at the locations of each 
cluster with A200 > 30. Though many of these are low 
signal-to-noise detections, they can be used to measure 
scatter. The IDL Astronomy library tool linmix_err is 
used to fit L x as a function of A 20 o with intrinsic scatter 
cini (Kelly 2007). This tool uses a Bayesian approach 
to linear regression with errors in X and Y and is well 
behaved even when the measurement errors dominate. It 
also handles non-detections and upper limits in Y. Monte 
Carlo simulations show that the selected A 2 oo cut pro- 
vides a large enough richness range to constrain the slope 
and scatter, and provides larger signal-to-noise than the 
entire cluster catalog, as over 80% of the 955 clusters with 

14 We note that the soft band Lx^^*H6H relation does not de- 
viate from a power-law at any richness, unlike the Lx bolo _ ^*666 
relation in Figure 10 of DKM07. 



A 2 oo > 30 are detected at at least the la level. Further- 
more, the fits are not sensitive to the precise richness 
cutoff chosen. 

To calculate the L x for each of these often marginally- 
detected clusters, we use a procedure based on the count- 
rate to flux conversion method from the REFLEX sur- 
vey (Bohringer et al. 2004). First, we calculate the 
0.5-2.0 keV (ROSAT hard channel) count rate in a 
750ft. _1 kpc aperture; the fixed aperture was used be- 
cause it results in better signal-to-noise than a larger 
scaled aperture (where we only see significant signal in 
the stacked profiles) . The local background is calculated 
in a 20' — 40' annulus using the sector-rejection method 
of Bohringer et al. (2000). 

In REFLEX the temperature is estimated in an iter- 
ative fashion from the luminosity, using the L x -T x re- 
lation of Markevitch (1998). As most of the maxBCG 
clusters do not have a significant flux (or luminosity) 
measurement, we cannot calculate the individual cluster 
temperatures in this way. Instead, each cluster tempera- 
ture is approximated by the stacked spectral temperature 
from the appropriate richness bin, as shown in Table 4. 
This ensures that our stacked cluster luminosities and in- 
dividual luminosity estimates are on approximately the 
same footing. Perhaps more importantly, changing the 
temperature does not change the 0.1-2.4 keV luminosi- 
ties significantly (e.g. Mohr et al. 1999). For example, 
for the clusters with > la detections, if we follow the 
REFLEX iterative recipe to convert count rate to L x , 
rather than fixing the temperatures at the stacked val- 
ues, the individual L x values change by < 3%. This is 
simply another example of how measuring temperatures 
with ROSAT is challenging. 

The 0.5-2.0 keV count rate is then converted to 0.1-2.4 
keV luminosity using Table 2 from Bohringer et al. (2004) 
with the equivalent hydrogen column density at the po- 
sition of the cluster (Dickey & Lockman 1990). After 
converting to luminosity, a fc-correction is applied using 
Equation 4 from Stanek et al. (2006), which is a good 
approximation of Table 3 in Bohringer et al. (2004). 15 
The /c-corrections are not very large, at most 10%. 

We compared our method of calculating L x for indi- 
vidual clusters to the L x values obtained in the NORAS 
catalog (see Section 4.2 for details on the X-ray catalog 
matching). For the matched clusters the values are all 
consistent within errors with <~ 10% scatter, and a sys- 
tematic offset of < 5%. The primary difference in our 
calculations is that the NORAS fluxes were calculated in 
an aperture obtained via growth curve analysis (GCA) 
designed to obtain the best signal-to-noise for each clus- 
ter, while we use a fixed physical aperture. Therefore, 
our fixed temperature and aperture provide an unbiased 
estimate of the cluster flux and luminosity even without 
a priori knowledge of extended cluster emission. Indi- 
vidual cluster L x and A 20 o values are shown in Figure 
7. 

We use linmix_err to estimate the power-law slope, 
normalization, and intrinsic scatter of the underlying dis- 
tribution of the L x -N 200 relationship for the 955 richest 

15 Note that there is a typographical error in Equation 4 of 
Stanek et al. (2006), so that it should read k(z,T) = {1 + [1 + 
log 10 (T/5k C V)}zy/ 2 . 




Fig. 7. — Lx vs. N200 for individual clusters with -/V200 > 30. The solid circles represent detections at the la level, and empty circles 
represent la upper limits. The typical error bar is plotted on the fictitious data point in the lower-right corner. Contours showing the ±lc 
contours on the best-fit median relation are shown in dark gray. The dashed (dotted) lines show the ±lai nL (±2ai nL ) scatter constraints. 
The median relation as constrained by linmix_err has been converted to the equivalent mean relation by multiplying the normalization by 
exp((T| nL /2), and is shown in light gray. The mean relation agrees well with the individual stacked bins (solid squares), which are about 
~ 30% brighter than the underlying median values. The empty squares denote "Abell X-Ray Undcrluminous" (AXU) clusters (Popesso 
et al. 2007), which have X-ray luminosities consistent with the observed scatter. 



clusters with 7V 2 oo > 30. The best fit relation is: 

/N vl.61i0.13 
L X (< 750 h- 1 kpc) = e 3-40±0.04 ( ^00 \ 1Q 42 ^-2 g 

. ^ 

with an intrinsic scatter a\ n L — 0.86 ±0.03. The individ- 
ual Lx values are calculated within a fixed 750 /i -1 kpc 
aperture, and thus this is to be compared to Equation 2. 
We test the robustness of this result by both splitting 
the input data into multiple independently fit subsam- 
ples and changing the richness threshold slightly. We 
find that the constraint on the scatter is robust, and the 
error bar is accurate. Possible systematic biases in the 
constraint on a\ n L are discussed in Section 5.6. 

The scatter-corrected Lx-N2qo relation is shown in 
Figure 7. The luminosities for the individual la detec- 
tions are plotted as solid circles, and the la upper lim- 
its are plotted as empty circles. The typical error bar 
for detections is shown in the lower-right corner. Con- 
tours showing the ±lcr contours on the best-fit median 
relation are shown in dark gray, and the dashed (dotted) 
lines show the ±lai n L (±2cri n L) scatter constraints. The 
median relation as constrained by linmix_err has been 
converted to the equivalent mean relation by multiplying 
the normalization by exp(cTj 2 n L /2), and is shown in light 
gray. The mean relation agrees well with the individual 



stacked bins (squares), which are about ~ 30% brighter 
than the underlying median values. 

We must emphasize that we are measuring the observed 
scatter in the Lx-N 2 oo relation as is appropriate to our 
catalog and stacking method. This scatter comes about 
not only due to intrinsic Lx variations between the clus- 
ters (including different morphologies and merging clus- 
ters), but is also due to point sources, cooling flows, and 
the projection of multiple clusters along the line of sight. 
For the ~ 1000 richest clusters (N200 > 30) used in this 
exercise, the chance of projection is very low. We do 
not have any way to remove point sources or bright cool 
cores from individual clusters in an unbiased way. Oper- 
ationally, it is not relevant to our measurement whether 
the scatter is due to intrinsic Lx variations or due to 
point source and cool-core contamination. To constrain 
the observed scatter, we need only assume that the un- 
derlying distribution is approximately log-normal, which 
is consistent with our observations. 

Gilbank et al. (2004) measure the observed scatter in 
the Lx~B gc relation, where B gc is a richness measure 
from the amplitude of galaxy-cluster correlation func- 
tion (Ycc & Lopez-Cruz 1999). They use 290 optically- 
selected clusters, of which 40 have significant detec- 
tions in ROSAT /PSPC observations. Using a Bayesian 
maximum-likelihood fitting technique, they find a sig- 
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nificant correlation between Lx (bolomctric) and _B gc , 
with a power-law slope of ~ 1.6 and an intrinsic scat- 
ter of <~ 50%. It must be noted that they introduce a 
prior weighting, Ppi-ior-t^in l) oc l/cinL, which gives more 
weight to models with lower scatter. Again, it is difficult 
to directly compare our measurement of the slope and 
scatter to the Gilbank et al. (2004) measurements, due 
to different richness quantities and selection functions. 

A large intrinsic scatter in the L x -N 2 oo relation can 
account for optically-selected clusters that appear "un- 
derluminous" in X-rays as compared to their optical rich- 
ness (e.g. Bower et al. 1994, 1997; Donahue et al. 2002; 
Ledlow et al. 2003; Basilakos et al. 2004; Gilbank et al. 
2004; Popesso et al. 2007). In particular, Popesso et al. 
(2007) present a set of "Abell X-ray Underluminous" 
(AXU) clusters with masses determined from velocity 
dispersion measurements, that are significantly dimmer 
in X-rays than expected. In Figure 7 the seven AXU clus- 
ters that match to maxBCG (0.1 < z < 0.3, A 200 > 30) 
are denoted with open squares. These arc all significantly 
dimmer than the mean Lx-N 2 m relation, but are fully 
consistent with the observed scatter. Velocity dispersion 
measurements of maxBCG clusters (B07) and compar- 
isons to simulations (Rozo et al. 2007b) have shown that 
there is considerable mass-mixing in a given A 2 oo bin. 
The low-luminosity clusters could be from lower-mass ha- 
los that are picked up as moderate richness in the cluster 
finder. However, Popesso et al. (2007) argue that the 
AXU clusters have low Lx relative to their mass. This 
could be due to biases in masses calculated from veloc- 
ity dispersions with small samples (e.g. Biviano et al. 

2006) , or to large intrinsic scatter in the Lx-M 2 qo rela- 
tion (e.g. Stanek et al. 2006). Each of these possibilities 
(or a combination of the two) are consistent with the 
observations. We further explore the LX-M200 relation 
of maxBCG clusters in a companion letter (Rykoff et al. 

2007) . 

We have reason to believe that the scatter is not fixed 
as a function of richness. For example, B07 find a signifi- 
cantly larger scatter in the velocity dispersion-richness 
(cxp(lncr)~A 2 oo) relation at low richness compared to 
high richness. However, the quality of our data for this 
exercise is not adequate to constrain the change in the 
scatter as a function of richness. Therefore, for the re- 
mainder of the paper we adopt this nominal value_of 
cini = 0.86 ± 0.03 to correct the stacked (mean) Lx 
to obtain the underlying median_Lx values. This essen- 
tially requires us to multiply all Lx values by a factor of 
0.69 ±0.02. 

4.2. Comparison to X-Ray Selected Clusters 

We can compare the Lx~N 200 relation for optically- 
selected maxBCG clusters to that from X-ray selected 
clusters from the literature. For this exercise we chose 
to compare to the NORAS (Bohringer et al. 2000) and 
400 square degree (400d: Burenin et al. 2006) catalogs, 
both constructed from ROSAT data, and thus measured 
in the same energy range as used in this work. The 
NORAS catalog is an X-ray flux limited cluster catalog 
constructed from the RASS photon maps in the north- 
ern sky. Although it is known to be only ~ 50% com- 
plete, it provides a large sample of bright X-ray clusters 
that overlap the maxBCG survey region. The 400d cat- 



alog is a serendipitous X-ray flux limited cluster catalog 
constructed from pointed ROSAT PSPC observations. 
The 400d catalog covers fields sampled from the whole 
sky, and thus only ~ 50% of the catalog overlaps the 
maxBCG survey region. 

Our intention in this exercise is to obtain a baseline 
comparison of the X-ray and optical richness proper- 
ties of X-ray selected clusters to optically-selected clus- 
ters. A somewhat different comparison of NORAS 
and maxBCG, focused on testing the completeness of 
maxBCG, was performed in Koester et al. (2007a). 

To obtain a list of clean matches between NORAS 
or 400d clusters and maxBCG clusters, we require that 
the X-ray position and BCG position be matched within 
250ft~ 1 kpc, and the redshift difference to be less than 
0.05. This ensures both that the probability of a false 
match is <C 1 and that we are not selecting, e.g., merging 
clusters where the X-ray and optical catalogs might uti- 
lize different dcblcnding schemes. There are 89 NORAS 
clusters that match maxBCG, with a median redshift of 
z = 0.18, and 53 400d clusters that match maxBCG, 
with a median redshift of z = 0.16. We convert all X- 
ray luminosities to 0.1-2.4 keV in the rest frame with our 
adopted cosmology. Note that the Lx values obtained in 
NORAS and 400d have been corrected for aperture ef- 
fects. However, with the adopted (i parameters ((3 = 2/3 
for NORAS, and (3 > 0.6 for 400d) these corrections pro- 
duce at most a ~ 10% offset in Lx compared to the 
luminosities obtained in our scaled i?200 apertures. 

Figure 8 shows Lx vs. A200 for the X-ray selected 
clusters that meet our matching criteria. The solid cir- 
cles represent NORAS clusters, and the empty squares 
represent 400d clusters. The luminosities of the 400d 
clusters are typically less than that for NORAS due to 
the deeper flux limit of the pointed ROSAT observations 
relative to the RASS survey. The dashed line shows the 
median Lx-N 200 relation, and the dotted lines show the 
±lcinL scatter constraints. It is not surprising to note 
that X-ray selection picks out primarily the X-ray bright- 
est clusters at a given richness; nearly all the NORAS 
clusters lie above the median maxBCG relation. The 
deeper 400d survey selects a sample more representa- 
tive of the optically-selected clusters, though it is still 
biased high. Comparison to Figure 7 is instructive. At 
every richness, the typical X-ray emission from clusters 
is significantly below what you might expect from X-ray 
selected catalogs. While this is true for both catalogs, 
it is especially apparent for NORAS clusters. Although 
the 400d survey has the sensitivity to detect the richest 
X-ray dim clusters (as in the lower-right corner of Fig- 
ure 7), these clusters are quite rare. The limited overlap 
between the 400d survey region and the maxBCG survey 
region means that it is very unlikely that the 400d survey 
would contain one of these specific maxBCG clusters. 

5. BIASES IN THE L x -N 200 RELATION 
There are a variety of systematic effects which may 
bias the Lx-N 2 oo relation. In this section we address 
six: photometric redshift uncertainty, cluster centering 
errors, richness variation with redshift, BCG luminosity, 
point source contamination, and cool core clusters. 

5.1. Photometric Redshift Uncertainty 
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Fig. 8. — Lx vs. N200 f° r X-ray selected clusters that match 
the maxBCG catalog within 250 h' 1 kpc. The 89 NORAS clusters 
(solid circles) are from a flux-limited survey using RASS, and the 
53 400d clusters (empty squares) are from a deeper flux-limited sur- 
vey using ROSAT /PSPC pointed observations. The dashed line 
shows the median LX-N200 relation, and the dotted lines show 
the ±lcri n _L scatter constraints. As seen in Figure 7 there is signif- 
icant scatter in the LX-N200 relation. The X-ray selected clusters, 
especially from NORAS, tend to sample the brightest tail of the 
distribution at a given richness. 



The maxBCG cluster catalog is based on photomet- 
ric data, and thus we only have photometric redshifts 
("photo-zs") for each of the clusters. While the photo-zs 
are relatively precise (A z < 0.015), we must investigate 
how using photometric redshift estimates might bias our 
stacking results. 

If a cluster is actually closer than its photometric red- 
shift would suggest, it will be over- weighted in the stack- 
ing analysis. When it is more distant than it seems, 
it will be underweightcd. Even if the photo-z errors 
are perfectly symmetric, the weighting factor, w(z) = 
{<Il{z) I dL{z)) 2 , is not. As a result, uncertainty in photo- 
z can introduce a bias in the luminosities. This bias can 
become significant if the photo- z errors are large. The 
net impact of this effect is to make the clusters appear 
more luminous than they truly are. 

There is also a small volume effect caused by photo- z 
errors. At a fixed redshift z there is a larger physical 
volume at a larger distance z + 5z than at a smaller dis- 
tance z — Sz. Because of this, more clusters are avail- 
able at high redshift to scatter low than at low redshift 
to scatter high. This is similar to Malmquist bias. As 
above, this effect will cause us to overestimate the cluster 
luminosities. 

In addition, the photometric redshift is used to cal- 
culate the cluster extraction radius. When a cluster is 
closer than its photo-z implies, then the extraction radius 
used is too small, slightly reducing its implied luminos- 
ity. The opposite is true when the cluster is farther than 
its measured photo-z. As there is very little cluster flux 
at large radii, and as this bias is roughly symmetric, we 
do not expect this effect to strongly bias the stacked Lx ■ 

We run a Monte Carlo simulation to estimate the com- 
bined photo-z bias as a function of richness. Around 37% 
of the BCGs in the maxBCG catalog used in this analysis 



have spectroscopic redshifts from the SDSS DR5 spec- 
troscopic catalog (Strauss et al. 2002; Eisenstein et al. 
2001). With the spectroscopic subsample, we measure 
the photo-z offset (Sz — z p h — z spec ) distribution in each 
richness bin and three redshift bins. A Monte Carlo is 
performed to estimate the overall bias in the measured lu- 
minosity by comparing the "true" luminosities convolved 
with the 5z distribution to the "observed" luminosities 
at the measured photometric redshifts. 

We find that the overall bias is small (< 6% at 99% 
confidence limit), with the observed luminosities slightly 
overestimating the true luminosities. This bias factor 
does not vary with richness. We neglect this correction 
factor due to its small magnitude. 

5.2. Centering Biases 

In our stacking exercise, we chose the BCG as the cen- 
ter of the cluster. For many clusters with individual X- 
ray observations and a bright BCG, this is coincident 
with the X-ray center. However, there are several types 
of clusters in which the BCG is not consistent with the 
center of the X-ray emission. These include merging 
clusters with disturbed morphologies; clusters without 
a dominant central galaxy for unambiguous detection by 
the maxBCG algorithm; and a few (~ 5) previously iden- 
tified strong cooling flow clusters. The central galaxies 
in these clusters host bright AGN and exhibit strong star 
formation, moving their colors off the red sequence and 
excluding them from the maxBCG cluster finding algo- 
rithm. 

We do not expect centering biases to be a significant 
problem for our calculation of the mean X-ray luminosity 
for the following reasons. First, the X-ray luminosity is 
proportional to the electron density p 2 , and thus most 
of the luminosity comes from the central region. With 
our large apertures (> 750/i _1 kpc) as long as the core 
of a cluster is within the aperture, the photon counting 
exercise will include most of the cluster photons. Second, 
we have reason to believe from simulations that most of 
the richer clusters are well centered (J07). This is not 
to say that the cluster centering will not have any effect: 
as shown in Section 3.3, the /3-model parameter is highly 
dependent on the centering distribution. 

To confirm this, we perform some simple tests of de- 
centering the clusters. If we randomly offset all the 
cluster positions with a 2D Gaussian with an rms of 
300 ft- -1 kpc (comparable to the centering distribution 
modeled in J07), the mean Lx values decrease by < 10%, 
and less than la. This rms value was chosen because, as 
is seen in Figure 4, the vast majority of the X-ray cluster 
matches are within 250 ft. -1 kpc. Therefore, we do not 
consider cluster miscentering to contribute a significant 
bias to our X-ray luminosity calculation. 

5.3. Richness Variation with Redshift 

We investigate whether the observed Lx at fixed_ rich- 
ness changes with redshift. Modest evolution in Lx at 
fixed mass is expected. If the clusters are evolving in a 
self-similar manner, then we expect the higher redshift 
clusters to be more luminous than the lower redshift clus- 
ters, due to the fact that the Universe was more dense at 
higher redshift. The expectation is that Lx oc p c (z) 7//6 , 
where p c is the critical density of the Universe at redshift 
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z (Kaiser 1986). Therefore, clusters at a given mass at 
a redshift z = 0.3 should be <~ 30% brighter than similar 
clusters at a redshift of z = 0.1. 

To study possible variation of Lx at fixed richness, 
we split the cluster sample into three redshift bins, con- 
taining the bottom 25%, middle 50%, and top 25% of 
clusters in our sample. The low redshift bin ranges from 
0.10 < z < 0.17 with a median redshift of z = 0.14; 
the middle redshift bin ranges from 0.17 < z < 0.26 
with a median redshift of z = 0.23; and the high red- 
shift bin ranges from 0.26 < z < 0.30 with a median of 
z = 0.28. To obtain increased signal-to-noise in these 
bins, we combine richness bins 41 < A^oo < 50 and 
51 < N 20 o < 70; 26 < N 200 < 32 and 33 < N 200 < 40; 
and 18 < N 2ao < 20 and 21 < N 2ao < 25. These wider 
bins are the same that were used in the lensing analysis 
of S07. 

Figure 9 shows the stacked Lx as a function of N 2 qq for 
the three different richness bins. The dashed line shows 
the mean relation (Equation 1) for all clusters from Sec- 
tion 4. The high redshift clusters (diamonds) are sig- 
nificantly more luminous than the low redshift clusters 
(circles). We parameterize the variation with a factor of 
(1 + z) 7 , and fit all the redshift and richness bins simul- 
taneously with a model of the form: 

L x (< i?20o) = e a 



40 



l + z 



1 



10 42 /r 2 ergss -1 , 



(6) 

where z = 0.23, the median redshift of the cluster cata- 
log. This results in a good fit (\ 2 jv — 18.4/15) with the 
best-fit parameters: a = 3.90 ± 0.04; (3 = 1.85 ± 0.05; 
7 = 6.0 ± 0.8. Note that a and (3 are consistent with 
the mean relation in Equation 1. The redshift variation 
parameter, 7, is quite large, and shows that the high red- 
shift bin (z = 0.28) is almost twice as bright as the low 
redshift bin (z = 0.14). This is significantly in excess of 
the self-similar prediction. The challenge is to determine 
the origin of this redshift dependent shift in Lx at fixed 
richness. 

The photo-z errors described in Section 5.1 may ac- 
count for some of this effect. Although the absolute 
photo-z errors are better than 0.015 at all redshifts, the 
relative photo-z errors are larger at low redshift than 
at high. Therefore, the impact of photo-z bias is larger 
at low redshift than at high redshift. We repeated the 
Monte Carlo simulation described in Section 5.1 to mea- 
sure the photo-z bias from each redshift bin. The addi- 
tional redshift bias is < 10% (99% confidence limit) in 
the low redshift bin, and < 2% (99% confidence limit) in 
the high redshift bin. The photo-z bias therefore remains 
smaller than the observed variation in Lx , although the 
effect is in the same sense as the variation we observe, 
and is a contributing factor. 

A more likely possibility is redshift dependent varia- 
tion of A^oo at fixed mass. If our high redshift clus- 
ters have systematically smaller A^oo at fixed mass, this 
would have the effect of shifting the high redshift points 
in Figure 9 to the left. In order to constrain this bias, 
we must first factor out the effect of the self-similar evo- 
lution of Lx at fixed mass, using the expected redshift 
dependence Lx oc p c (z) 7 ^ 6 as described above. We then 
re-fit the data to Equation 6 to calculate the excess vari- 
ation in the Lx~N 20 o relation that may be attributed 
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Fig. 9. — Mean Lx—Nzoo in three redshift bins, calculated within 
the scaled -R200 apertures. The high redshift bin (diamonds) are 
significantly more luminous than the low richness bin (circles), indi- 
cating significant variation in the LX-N200 relation with richness. 
The dashed line is the stacked mean relation from Equation 1, 
which is consistent with the mean relation for the central redshift 
bin (squares). 



to variation of A^oo at fixed mass. This results in a 
best-fit 7 = 4.5 ± 0.8, implying a fractional decrease in 
N 2m of 30%-40% from our lowest redshift bin (z = 0.14) 
to our highest redshift bin (z = 0.28). This is consis- 
tent with the redshift dependent variation in the veloc- 
ity dispersion-optical richness relation measured by B07. 
However, we must note that redshift dependence in the 
observed lensing shear, AS, is significantly smaller (S07). 
Unfortunately, none of these approaches is yet able to 
confidently determine the nature of the observed varia- 
tion. For example, this effect could be caused by incor- 
rect handling of the 0.4 L* and color cuts that determine 
which galaxies are included in the richness estimate. It 
might also be "true" evolution, such that clusters have 
fewer red-sequence galaxies brighter than 0.4 L* at fixed 
mass at higher redshifts. Further work, including im- 
proving the richness estimates, is in progress to constrain 
the nature of this evolution. 

5.4. BCG Luminosity 

We now investigate the effect BCG i-band luminos- 
ity (Lbcg, see Koester ct al. (2007a) for details) has 
on the mean X-ray luminosity Lx- Simulations and 
semi-analytical modeling predict that dark matter ha- 
los formed at early times have brighter BCGs and lower 
richness than those which form late (Zentner et al. 2005; 
Wechsler et al. 2006; Croton et al. 2007). Thus, we ex- 
pect that the X-ray luminosity should be correlated with 
Lbcg at fixed richness. Furthermore, optical and X-ray 
observations of individual nearby clusters show that more 
luminous BCGs are correlated with more massive halos 
with higher X-ray temperature and luminosity (e.g. Lin 
& Mohr 2004). This trend has already been seen in the 
mean velocity dispersion analysis of maxBCG clusters, as 
the clusters with more luminous BCGs had significantly 
larger velocity dispersions than clusters with similar rich- 
ness (B07). 

In an exercise similar to that performed in the previous 
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Fig. 10. — Mean LX—N200 (within the scaled -R200 apertures) 
split according to BCG luminosity. Each A^oo bin has been sorted 
by the optical luminosity of the BCG (£bcg) of each cluster in 
the bin. The stacking exercise has been performed on the top 
25% (diamonds); middle 50% (squares); and bottom 25% (circles). 
Note that Lbcg is correlated with -/V200, and thus the Lbcg splits 
arc different for each -/V200 bin. The dashed line shows the mean 
relation for all the clusters from Equation 1. The clusters with the 
most luminous BCGs also tend to be the most luminous in X-rays. 
This effect is strongest at low richness, where the BCG is a larger 
fraction of the total optical luminosity of the cluster. 



section, we split each wide richness bin into three bins 
of Lbcg- As Lbcg, unlike redshift, is correlated with 
A2007 we are unable to use the same Lbcg split for each 
-/V200 bin. Therefore, we sort the clusters in each richness 
bin by Lbcg? and split the sample into the top 25%, 
middle 50%, and bottom 25%, and then restack. The 
results are shown in Figure 10. It is readily apparent 
that the clusters with the most luminous BCG in a given 
richness bin (diamonds) arc significantly more luminous 
in X-rays than other clusters in the bin. This effect is 
most dramatic at lower richnesses. This is expected, as 
the BCG is more dominant in low richness clusters than 
high richness clusters (e.g. Lin & Mohr 2004). It is also 
notable that there does not appear to be a significant 
difference between the Lx for the low Lbcg an d middle 
Lbcg bins. 

We parameterize the effect of Lbcg ° n Lx using a 
similar procedure as performed for the redshift variation. 
The richness and Lbcg bins are fit simultaneously with 
a model of the form: 

f 1 ^ d \ a ( N2oo\^ ( Lbcg \ v , n 42 . -2 -1 
L x «R 20Q ) = e [j^J 10 h ergss . 

(7) 

This results in an adequate fit (\ jv = 25.9/15) with 
the best-fit parameters: a = 3.86 ±0.04; [3 = 1.50 ±0.06; 
rj = 0.82 ± 0.10. Although the normalization is the same 
as in Equation 1, the slope is significantly more shal- 
low. This is caused by the correlation between Lbcg 
and A^2oo- After taking this into account, the mean re- 
lation shown here is fully consistent with that calculated 
without splitting the bins by Lbcg • The reason the x 2 of 
the fit is relatively high is because it does not appear that 
Lx is smooth function of Lbcg : the effect of Lbcg is not 
symmetric, and only tends to boost the X-ray luminosity 



of clusters with relatively bright BCGs. 

It is not entirely clear whether this effect is due to an 
imperfect cluster finder, our crude richness definition, or 
due to real cluster physics. Clusters with brighter, more 
dominant BCGs are more likely to be correctly centered 
and thus might be more luminous in X-rays. However, as 
we showed in Section 5.2, the effect of decentered clus- 
ters on Lx is much smaller than the effect we see due 
to Lbcg- Meanwhile, our richness definition, -/V200, is a 
count of red-sequence galaxies brighter than 0.4 L». It 
is thus not surprising that a poor cluster with an excep- 
tionally bright BCG might be more massive and more 
X-ray luminous than a cluster at a similar richness with 
a more typical BCG. Finally, it is possible that these clus- 
ters are associated with halos that formed early, allowing 
many of their member galaxies to merge into a very large 
BCG (Zentner et al. 2005; Wechsler et al. 2006; Croton 
et al. 2007). Deeper targeted X-ray observations of spe- 
cific clusters will be required to determine if we can use 
Lbcg and other optical properties to constrain the age 
of its dark matter halo. 

5.5. Point Source Contamination 

When calculating the projected X-ray luminosity from 
the hot intracluster medium, there is the possibility of 
contamination due to X-ray point sources. Some point 
sources, such as cluster AGN, are associated with the 
cluster, but in general their luminosities are not tightly 
correlated with that of the ICM. Other point sources 
that are chance coincidences, such as foreground stars 
and background quasars, might also boost the apparent 
luminosity. The problem of point source contamination 
is exacerbated by the broad PSF of the RASS survey. 
Due to the large PSF, it is difficult to accurately ex- 
cise point sources from the higher redshift clusters, even 
when their positions are known. For the stacking exer- 
cise, foreground and background sources are not a sig- 
nificant problem. Assuming the positions of these point 
sources are uncorrelated with the cluster positions, the 
contribution from these sources is subtracted with the 
background estimation, as is demonstrated by our stack 
of random points in Figure 1. At the same time, these 
point sources, both those associated with the clusters 
and those that are chance projections, may increase the 
observed scatter. In general, we note that Lx values 
calculated from RASS arc not corrected for point source 
contamination (e.g. Bohringer et al. 2000, 2004). We 
wish to perform a first-order check on possible contami- 
nation from cluster AGN, the_dominating point sources 
that might bias the observed Lx- 

Martini et al. (2007) have recently performed a detailed 
survey of the distribution of AGN in galaxy clusters for 
eight moderate redshift clusters, comparable to the red- 
shift range of the maxBCG catalog. They find that mod- 
erately bright AGN (Lx > 10 42 ergs _1 ) associated with 
bright cluster members (> 0.5 L*) make up around ~ 1% 
of the total cluster member population. This implies that 
many of the richest maxBCG clusters contain at least 
one moderately bright_AGN. These clusters have an ex- 
tremely bright ICM (Lx ~ 10 44 h~ 2 ergs s _1 ), and thus 
the fraction of the luminosity from cluster AGN will, on 
average, be very small. If we extrapolate the same AGN 
fraction to our poorest clusters (N 2 oo ~ 10, perhaps one 
in ten will host a moderately bright AGN. Although these 
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Table 5. Radio Source maxBCG Member Matches 



Richness Range 


■ — 

Clusters w/ 
Radio Sources 


— — — 

% Clusters w/ 

Radio Sources 


71 < 7V 2 oo < 188 


45 


82% 


51 < N 200 < 70 


91 


65% 


41 < A^oo < 50 


125 


62% 


33 < N 200 < 40 


183 


54% 


26 < A^oo < 32 


284 


45% 


21 < N 200 < 25 


420 


37% 


18 < A^oo < 20 


332 


30% 


12 < AT 20 o < 17 


1330 


25% 


9 < JV200 < 11 


1341 


18% 



AGN might have a luminosity comparable to the ICM for 
these 10% of poor clusters, this small fraction should not 
strongly bias the stacked average. 

As a simple check for contamination in the X-ray signal 
from cluster AGN, we match maxBCG cluster member 
galaxies to radio sources from the FIRST survey (White 
et al. 1997) with a match radius of 3". By matching 
random galaxies from SDSS DR5 to the FIRST sources, 
we estimate the purity of the matches to be > 99%. The 
FIRST survey covers approximately the same footprint 
as the SDSS, down to a typical flux limit of lmJy at 
1.4 GHz. Using 1.4 GHz radio detections as a proxy for 
AGN activity has the advantage that the radio spectrum 
is often flat, and thus the 1.4 GHz flux is not a strong 
function of redshift. The disadvantages are that only 
10% of X-ray bright quasars and AGN are radio loud, 
and that there is a large scatter between X-ray and radio 
luminosities of these objects. For example, Martini et al. 
(2007) noted that for the same small set of clusters none 
of the X-ray selected AGN are radio loud Morrison et al. 
(2003). 

Although we have an incomplete selection function, we 
can still constrain the contamination from cluster AGN. 
If the mean Lx is significantly boosted by cluster AGN, 
then we would expect the clusters which match radio 
sources to be relatively bright. This effect would be 
greatest at low richness and low Lx when the ICM is 
not as hot or bright. For each richness bin, we matched 
the maxBCG cluster members within 750 h~ x kpc of the 
BCG to sources in the FIRST survey. Table 5 shows 
the number of clusters in each bin that have members 
matched with the FIRST catalog. More detailed work 
is in progress in cross-correlating the maxBCG catalog 
with radio sources to explore cluster AGN feedback and 
related issues (see also Croft et al. (2007)). 

We wish to compare how the radio selected subset of 
clusters compares to a randomly selected subset of clus- 
ters of similar richness. In each richness bin, we ran 
10000 trials by sampling (with replacement) from the en- 
tire set of clusters in the bin, sampling the same number 
of clusters that match radio sources. As a quick esti- 
mate of the mean luminosity, rather than performing the 
full stacking analysis and spectral fits, we calculated the 
stacked and weighted, background-subtracted hard chan- 
nel (0.5-2.0 keV) counts in a fixed 750 h" 1 kpc aperture. 
The resulting count rate was converted to Lx using the 
method described in Section 4.1. We can then compare 
the mean Lx of the clusters which match radio sources 
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Fig. 11. — Histograms of the distribution of Lx values obtained 
from resampling the same number of clusters which match FIRST 
radio sources (as a proxy for AGN contamination) in each bin. The 
vertical dashed lines indicate Lx for the clusters which match radio 
sources. The clusters with radio matches are not significantly more 
luminous than those drawn from the distribution, except for the 
12 < A r 200 < 17 bin (discussed in the text). This would indicate 
that, on average, AGN emission is not biasing our stacked Lx 
values. 



to the typical values of Lx that we would expect when 
drawing that same number of clusters from the entire 
sample. The results are shown in Figure ll._ In each 
panel the histogram shows the distribution of Lx values 
obtained from the resampling, and the vertical dashed 
line marks the mean Lx of the radio-matched clusters. 
We do not see any significant bias as a function of rich- 
ness. The only bin in which the radio matches appear to 
be outliers is the 12 < A^oo < 17 richness bin. This is due 
to random chance, in that the clusters in this richness bin 
with the highest Lx also happen to match radio sources; 
although there is no obvious indication in the RASS data 
that there is point source contamination on these clus- 
ters, further follow-up with Chandra or XMM/Newton 
would be required to clarify this. 

5.6. Cool Core Clusters 

There is much evidence that cool core clusters increase 
the scatter in X-ray cluster properties. By excluding 
cluster cores in high resolution imaging, many X-ray pa- 
rameters become more tightly correlated (e.g. O'Hara 
et al. 2006; Chen et al. 2007; Maughan 2007). The 
broad ROSAT PSF, combined with the fact that most 
of our clusters do not have individual X-ray detections, 
makes it impossible to exclude cluster cores in our stack- 
ing analysis. However, a few of the brightest clusters 
(> 10 44 h~ 2 ergss -1 ) at moderate richness (A200 ~ 45) 
are well known cool-core clusters. We thus investigate 
what effect cool core clusters have onjDur estimate of the 
median Lx and scatter o\ n l in the LX-N200 relation. 

There does not exist an unbiased volume-limited cat- 
alog of cool core X-ray clusters which can be fairly com- 
pared to the maxBCG catalog. Peres et al. (1998) used 
ROSAT pointed observations of an X-ray flux limited 
catalog (Edge et al. 1990) to estimate the central cool- 
ing time of 55 nearby clusters. Two of these clusters 
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are in the maxBCG catalog (A1689, A2244) and exhibit 
characteristics of a cool core (t coo i < lOGyr). Bauer 
et al. (2005) used Chandra observations of the higher 
rcdshift (z > 0.15) BCS clusters to systematically esti- 
mate the central cooling time of bright X-ray clusters. 
We identify 7 maxBCG clusters (A750, A963, A1835, 
RXJ 2129.6+0005, Z2701, Z3146, Z7160) from this sam- 
ple with cooling times t coo i < 10 Gyr which we mark 
as cool core clusters. Finally, we identify C1G J1504- 
0248 (Bohringer et al. 2005) as maxBCG cluster with a 
cool core. It should be noted that most of these clusters 
have a moderate richness (30 < -/V200 < 50) and are in 
the brightest tail of the Lx distribution for their richness. 
Furthermore, the central galaxies of these clusters tend 
to show strong Ha emission in SDSS spectroscopy. In 
fact, for a few of these clusters the true BCG is not in the 
list of cluster members because the strong Ha emission 
changes the apparent galaxy color, making it inconsis- 
tent with the red-sequence used in the cluster finding 
algorithm. 

After excluding these known cool core clusters (which 

might bias the LX-N200 relation) we recalculate the 
stacked mean relationship as well as the median rela- 
tionship with scatter. When calculating the median re- 
lationship with scatter, as in Section 4.1, the normaliza- 
tion and slope shift by <C la, as these values are not 
strongly affected by outliers. However, the intrinsic ob- 
served scatter decreases to a\ a L = 0.77 ± 0.03, which is 
a <~ 2a shift. It is not surprising that a\ a L decreases: we 
are deliberately removing the brightest clusters. Mean- 
while, the stacked relation is also slightly affected by 
these bright outliers, and after cutting the known cool 
core clusters the normalization of the stacked relation de- 
creases by ~ l<7. After following the previously described 
prescription for converting the stacked mean relation to a 
scatter-corrected median relation, we find that removing 
the known cool core clusters decreases the normalization 
by < 5%. Therefore, it does not appear that the known, 
very X-ray luminous, cool core clusters, are significantly 
biasing our relation. We reiterate that the intrinsic scat- 
ter we measure is the observed scatter that takes into 
account all X-ray observations of the maxBCG clusters, 
regardless of X-ray morphology of the clusters. Thus, 
it is likely that some portion of this observed scatter is 
caused by unidentified cool core clusters in the sample. 

6. THE LUMINOSITY-VELOCITY DISPERSION RELATION 

(Lx-Z) 

B07 have measured the median velocity dispersion 
(<7 = cxp(lner)) as a function of A200 for the maxBCG 
catalog. The richness bins used to measure the stacked 
velocity dispersions, as well as the scatter in the a— 
A200 relation, are the same as those used in this work. 
However, in order to measure velocity dispersions, only 
clusters that have the BCG and at least one additional 
member galaxy in the SDSS DR5 spectroscopic subsam- 
ple (Strauss et al. 2002; Eiscnstcin et al. 2001) were 
used. The spectroscopic subsamplc preferentially selects 
bright/nearby galaxies, and thus the selection of clusters 
used in B07 (z = 0.16, with z > 0.05) is slightly differ- 
ent than that for the full maxBCG catalog (z = 0.23, 
with z > 0.1). As discussed in Section 5.3, the lower red- 
shift clusters have a smaller Lx than the higher redshift 
clusters. We therefore re-run our stacking procedure on 



Table 6. L x and a for Spectroscopic Subsample 



Richness Range 


W200 


-R200 
(h- 1 Mpc) 


Lx 

(10 42 h-' 2 ergs s" 1 ) 


a 

km s~ 1 


71 < JV200 < 188 


83.9 


1.69 


131 ± 28 


854 ± 103 


51 < 7V 2 oo < 70 


58.4 


1.45 


86.6 ±11.4 


645 ± 58 


41 < 7V 2 oo < 50 


44.7 


1.32 


57.2 ±6.9 


645 ± 58 


33 < 7V 2 oo < 40 


35.9 


1.22 


40.3 ±4.9 


539 ± 38 


26 < 7V 2 oo < 32 


28.6 


1.10 


24.5 ±2.5 


518 ± 26 


21 < 7V 2 oo < 25 


22.7 


1.02 


15.2 ±2.8 


441 ± 22 


18 < 7V 2 oo < 20 


19.0 


0.94 


9.05 ± 1.39 


459 ± 60 


12 < AT 20 o < 17 


14.0 


0.82 


6.95 ± 0.84 


384 ± 15 


9 < A^oo < 11 


9.9 


0.73 


2.94 ± 0.62 


311 ± 12 




300 _ 1 000 

cj (km s" 1 ) 



Fig. 12. — Median L x vs. a for the maxBCG clusters. L x and 
cr for each stacked data point (solid circles) is measured in a given 
richness bin using the cluster selection from B07. The gray band 
represents the best-fit relation (±l<x), where L x oc g? 4 -30±0.43^ 
consistent with self-similar evolution. The empty circles show Lx 
and a values for individual X-ray selected REFLEX clusters as 
measured in the 2df survey (Hilton et al. 2005), corrected to the 
cluster rest frame. The fictitious data point in the legend in the 
upper left shows the typical errors on these values. The individual 
X-ray selected clusters have slightly higher (~ la) velocity disper- 
sions at a given X-ray luminosity, as compared to the maxBCG 
stacked measurements. 



exactly those clusters used in B07, and use the known 
spectroscopic redshifts of the BCGs rather than the pho- 
tometric redshifts. This ensures that, for this exercise 
at least, we are not affected by any possible photometric 
rcdshift biases, which otherwise could be significant for 
the nearest clusters (z < 0.1) where the relative photo-z 
error (A z /z > 15%) is quite large. Table 6 shows the 
richness bins used, as well as the mean Lx (this work) 
and median a (from B07) for the nine richness bins. The 
slope of the Lx-N 2 oo relation is nearly identical for the 
spectroscopic subsample as reported in Section 4, but 
the normalization is lower by 10%, due to the different 
rcdshift selection. 

We take the mean Lx values and corrected them for 
scatter as described in Section 4.1 to obtain Lx- Fig- 
ure 12 shows the median Lx~a relation for maxBCG 
clusters with richness -/V200 > 9. Each data point (solid 
circles) is obtained from the same richness bin and same 
cluster selection. The dark gray band shows the best-fit 



X-ray properties of maxBCG clusters 



19 



relation (±ler) from clusters in the spectroscopic subsam- 
plc of the maxBCG catalog, with the following functional 
form: 



Lx{< -R200) = e 



2.73±0.11 



a 



500 km s" 



4.30±0.43 



h- 2 



crgss 
(8) 

The scaling relation predicted by self-similar cluster evo- 
lution is that Lx ex T 2 cx er 4 (e.g. Kaiser 1986; Marke- 
vitch 1998), which is consistent with our observations. 
We note that if we could convert our 0.1-2.4 keV lumi- 
nosities to Lx.ho\o, this would steepen our relation, as 
the clusters with higher velocity dispersions also have 
a higher temperature and larger Lx,bo\o/Lx, 0.1-2.4 ra- 
tio. As mentioned previously, such a conversion is not 
practical for the stacking exercise using ROSAT data. 
The slope of our 0.1-2.4 keV Lx~a relation is consistent 
with that measured for REFLEX clusters (Ortiz-Gil et al. 
2004; Hilton et al. 2005). 

We compare this median relation to that obtained pre- 
viously for individual X-ray selected clusters. Hilton 
et al. (2005) calculated the velocity dispersions of RE- 
FLEX clusters by cross-correlating the REFLEX catalog 
with the Two-degree Field Galaxy Rcdshift Survey (2dF; 
Colless et al. 2001). Figure 12 shows 39 individual RE- 
FLEX clusters (empty circles) from Hilton, et al. (2007, 
private communication) after converting the velocity dis- 
persions to the cluster rest frame. In the upper-left cor- 
ner the empty circle shows the typical errors in 0.1-2.4 
keV L x and' a for the REFLEX/2dF clusters. Hilton 
et al. (2005) found the slope of the X-ray selected Lx~ 
a relation to be 4.0 ± 0.6, consistent with our observa- 
tions. However, the individual X-ray selected clusters 
tend to have a slightly higher (~ la) velocity disper- 
sion at a given X-ray luminosity compared to the stacked 
maxBCG clusters. This is possibly caused by the dif- 
ficulty in measuring velocity dispersions for individual 
clusters. Contamination from non-virialized galaxies and 
interlopers will more likely result in an overestimate of 
the velocity dispersion rather than an underestimate. 

7. SUMMARY 

By stacking observations from the ROSAT All-Sky 
Survey, we measure the mean X-ray luminosity as a 
function of richness for optically-selected clusters from 
the maxBCG catalog. With a large number of clus- 
ters in each bin, the stacking exercise has the power to 
probe to much lower flux limits than are possible for X- 
ray selected surveys. Thus, a highly pure and complete 
volume-limited catalog with clusters selected by their op- 
tical properties can be used to measure the mean Lx ■ 

We find that Lx scales with optical richness, N200, 
with a simple power-law form over two orders of magni- 
tude in Lx- Our results are similar to those obtained in 
DKM07 by stacking RASS observations of clusters iden- 
tified in 2MASS. However, we find that stacked tempera- 
ture measurements from ROSAT are significantly biased 
for higher temperature clusters. Therefore, we restrict 
our analysis to luminosities calculated in the 0.1-2.4 keV 
band rather than extrapolating to calculate bolometric 
luminosities. Furthermore, we have shown that the (3- 
model fits to the stacked radial profiles of moderate red- 
shift clusters are dominated by a combination of the 
broad ROSAT PSF and the offset distribution between 



maxBCG clusters and correlated X-ray sources. On the 
other hand, calculating X-ray luminosity in the 0.1-2.4 
keV band, which relies primarily on counting photons, is 
not strongly biased by spectral fits or profile fits. As Lx 
- 1 scales with the square of the density, as long as the core 
of a cluster is within the stacking aperture, we can obtain 
a high reliability estimate of the mean cluster luminosity. 

By measuring the X-ray flux at the positions of indi- 
vidual maxBCG clusters, we are able to constrain the 
observed scatter in the Lx-N 2 oo relation. Assuming a 
log-normal distribution of Lx as a function of N 2 oo, we 
find oinL = 0.86 ± 0.03. This scatter is quite large, and 
thus it is necessary to correct for the scatter when cal- 
culating the median of the underlying distribution Lx- 
The richness measure we have used in this work, N20O1 is 
simply the count of red-sequence galaxies within a scaled 
-R200 aperture, but nevertheless is a good proxy for Lx 
on average. It is likely that improved richness measure- 
ments can be made. From the evidence presented here, 
it seems likely that these new richness estimators will in- 
clude information about BCG luminosity, and will more 
carefully control variations in cluster membership with 
rcdshift. 

The large scatter in the Lx-richness relation does have 
some significant effects on cluster selection. First, X-ray 
selection with a high flux limit, as with the NORAS cat- 
alog, tends to pick out the clusters with the highest Lx 
at a given richness. Second, the large scatter in the Lx~ 
N200 relation means that there are a significant number 
of optically selected clusters that are seemingly "under- 
luminous" . Optically selected clusters will often appear 
relatively dim compared to X-ray selected clusters at a 
similar richness. 

The goal of cluster selection and identification is to 
find a low scatter proxy for halo mass. Both X-ray 
and optical selection techniques suffer from different lim- 
itations. X-ray flux limited surveys can only find the 
brightest X-ray clusters, and are thus limited to a com- 
bination of the brightest and nearest clusters. Further- 
more, there may be significant scatter between X-ray lu- 
minosity and halo mass (Stanek et al. 2006; Nord et al. 
2007). Optical surveys such as maxBCG may be vol- 
ume limited out to moderate redshift, but optical rich- 
ness also has a large scatter with respect to X-ray lumi- 
nosity and mass (B07). The development of more precise 
mass proxies, including Tx, the Compton Y parameter 
(Nagai 2006), Yx (Kravtsov et al. 2006), and potentially 
more sophisticated optical richness estimates may allow 
selection of cluster catalogs more closely approximating 
the mass-limited catalog we aspire to. 

By combining our present analysis with the velocity 
dispersion measurements of B07, we have measured the 
median Lx~a relation for the maxBCG clusters. The 
relation has a slope of 4.30 ± 0.43, consistent with the 
prediction of self-similar cluster evolution. Previous mea- 
surements of the Lx~a relation have been complicated by 
both selection effects and a large scatter, as the derived 
slope depends strongly on the fitting technique used. By 
stacking clusters from a volume-limited catalog and cor- 
recting for scatter, both Lx and a are much better con- 
strained, allowing a robust calculation of the relation. 
In our analysis, we do not see any evidence of a break 
in the Lx~a relation at the poor cluster (group) scale, 
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as has been hinted at previously (e.g. Xue & Wu 2000). 
This might be due to the fact that the maxBCG clus- 
ter finder works with red-sequence clusters, and does not 
find groups with a large blue galaxy fraction. 

We can also compare Lx as determined from this 
method to weak lensing masses determined from the 
same maxBCG cluster catalog (S07, J07). Determin- 
ing the LX—M200 relation is not only important for un- 
derstanding cluster physics, but for calibrating the se- 
lection function of X-ray clusters used as cosmological 
probes (e.g. Stanek et al. 2006). We have made this 
measurement in a companion letter (Rykoff et al. 2007). 
In addition, the comparison of the X-ray luminosities, 
weak lensing masses, and velocity dispersions can be used 
in conjunction with the number function of maxBCG 
clusters to constrain cosmological parameters (e.g. Rozo 
et al. 2007b). This work is ongoing. 

Refined measurements of the scatter, point source con- 
tamination, and confirmation of catalog purity will re- 
quire deeper pointed X-ray observations. We are cur- 
rently investigating serendipitous maxBCG cluster ob- 
servations by deeper pointed i?0,S^4T/PSPC observa- 
tions, as well as XMM/Newton and Chandra Plionis 
et al. (e.g. 2005). Unfortunately, most of the serendipi- 
tous cluster observations are of relatively poor clusters, 
which are much more numerous. A targeted campaign of 
a representative sample of maxBCG clusters that have 
not previously been known to be X-ray bright will be 
essential to both test our measurement of the scatter in 
LX-N200 and to estimate the fraction of cluster flux that 
is contaminated by point sources. This will be essential 
to enable future inexpensive deep optical cluster surveys 
such as DES (Abbot et al. 2005) to use optical proper- 
ties of clusters to estimate which clusters would be most 
useful to follow-up with targeted X-ray observations. 
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